AI Safety & Governance Full-time

Software Engineer, Benchmarking

Epoch AI

Location

Remote, Global

Type

Full-time

Posted

Dec 12, 2025

Mission

What you will drive

Help evaluate frontier AI models by developing and maintaining benchmarking infrastructure
Implement AI benchmarks within evaluation infrastructure to expand tracked capabilities
Develop existing benchmarks to quickly evaluate new model releases
Contribute to brand new benchmarks by pitching and prototyping your own ideas

Impact

The difference you'll make

This role contributes to the evaluation of frontier AI models, helping to advance understanding and development of AI capabilities through systematic benchmarking.

Profile

What makes you a great fit

Software engineering skills for developing and maintaining benchmarking infrastructure
Experience with AI model evaluation and benchmarking
Ability to collaborate with researchers and analysts
Creative thinking for pitching and prototyping new benchmark ideas

Benefits

What's in it for you

No benefits information provided in the job description.

About

Inside Epoch AI

Visit site →

Epoch AI appears to be an organization focused on AI research and evaluation, specifically working on benchmarking frontier AI models.

Apply

Ready to move?

Remote Impact verifies every role with the hiring team. Submissions go directly to Epoch AI.