AI Safety & Governance
Full-time
Software Engineer, Benchmarking
Epoch AI
Location
Remote, Global
Type
Full-time
Posted
Dec 12, 2025
Mission
What you will drive
- Help evaluate frontier AI models by developing and maintaining benchmarking infrastructure
- Implement AI benchmarks within evaluation infrastructure to expand tracked capabilities
- Develop existing benchmarks to quickly evaluate new model releases
- Contribute to brand new benchmarks by pitching and prototyping your own ideas
Impact
The difference you'll make
This role contributes to the evaluation of frontier AI models, helping to advance understanding and development of AI capabilities through systematic benchmarking.
Profile
What makes you a great fit
- Software engineering skills for developing and maintaining benchmarking infrastructure
- Experience with AI model evaluation and benchmarking
- Ability to collaborate with researchers and analysts
- Creative thinking for pitching and prototyping new benchmark ideas
Benefits
What's in it for you
No benefits information provided in the job description.
About
Inside Epoch AI
Epoch AI appears to be an organization focused on AI research and evaluation, specifically working on benchmarking frontier AI models.