AI Safety & Governance
Full-time
Software Engineer, Benchmarking
Epoch AI
Location
Remote
Type
Full-time
Posted
Dec 11, 2025
Compensation
USD 150000 – 150000
Mission
What you will drive
Core responsibilities:
- Run and maintain AI benchmarking infrastructure
- Integrate with AI providers and set up existing benchmarks
- Design and develop new AI benchmarks
- Facilitate internal experiments and work with the benchmarking team
Impact
The difference you'll make
This role helps evaluate frontier AI models, enabling researchers, developers, and policymakers to better understand AI development and its consequences.
Profile
What makes you a great fit
Required qualifications:
- Professional level English proficiency
- Ability to overlap with UTC-8 to UTC time zones
- Ability to travel for three retreats per year
- Software engineering skills for benchmarking and infrastructure development
Benefits
What's in it for you
Benefits include:
- Fully remote work with ability to hire in many countries
- Inclusive, equitable, and supportive community environment
- Three retreats per year for team building and communication efforts
- Commitment to diversity and accessibility
About
Inside Epoch AI
Epoch AI is a research institute that investigates trends in machine learning and the economic consequences of AI, developing a comprehensive, publicly accessible knowledge base to inform policymakers, industry leaders, and society.