AI Safety & Governance Full-time

Software Engineer, Benchmarking

Epoch AI

Location

Remote

Type

Full-time

Posted

Dec 11, 2025

Compensation

USD 150000 – 150000

Mission

What you will drive

Core responsibilities:

  • Run and maintain AI benchmarking infrastructure
  • Integrate with AI providers and set up existing benchmarks
  • Design and develop new AI benchmarks
  • Facilitate internal experiments and work with the benchmarking team

Impact

The difference you'll make

This role helps evaluate frontier AI models, enabling researchers, developers, and policymakers to better understand AI development and its consequences.

Profile

What makes you a great fit

Required qualifications:

  • Professional level English proficiency
  • Ability to overlap with UTC-8 to UTC time zones
  • Ability to travel for three retreats per year
  • Software engineering skills for benchmarking and infrastructure development

Benefits

What's in it for you

Benefits include:

  • Fully remote work with ability to hire in many countries
  • Inclusive, equitable, and supportive community environment
  • Three retreats per year for team building and communication efforts
  • Commitment to diversity and accessibility

About

Inside Epoch AI

Visit site →

Epoch AI is a research institute that investigates trends in machine learning and the economic consequences of AI, developing a comprehensive, publicly accessible knowledge base to inform policymakers, industry leaders, and society.