AI Safety & Governance Full-time

Software Engineer, Benchmarking

Epoch AI

Location

Remote, Global

Type

Full-time

Posted

Dec 12, 2025

Mission

What you will drive

  • Help evaluate frontier AI models by developing and maintaining benchmarking infrastructure
  • Implement AI benchmarks within evaluation infrastructure to expand tracked capabilities
  • Develop existing benchmarks to quickly evaluate new model releases
  • Contribute to brand new benchmarks by pitching and prototyping your own ideas

Impact

The difference you'll make

This role contributes to the evaluation of frontier AI models, helping to advance understanding and development of AI capabilities through systematic benchmarking.

Profile

What makes you a great fit

  • Software engineering skills for developing and maintaining benchmarking infrastructure
  • Experience with AI model evaluation and benchmarking
  • Ability to collaborate with researchers and analysts
  • Creative thinking for pitching and prototyping new benchmark ideas

Benefits

What's in it for you

No benefits information provided in the job description.

About

Inside Epoch AI

Visit site →

Epoch AI appears to be an organization focused on AI research and evaluation, specifically working on benchmarking frontier AI models.