AI Safety & Governance Full-time

Researcher, Evaluations

Epoch AI

Posted

Jul 01, 2026

Location

Remote

Type

Full-time

Compensation

Up to $200000

Mission

What you will drive

  • Evaluate frontier AI models on real-world tasks by curating and refining benchmark suites.
  • Create and curate an evaluation suite with realistic tasks that challenge practical AI capabilities.
  • Design grading rubrics and assess AI performance quantitatively and qualitatively on complex work.
  • Communicate research findings through reports, blog posts, and visualizations to inform stakeholders.

Impact

The difference you'll make

This role helps ensure frontier AI models are rigorously evaluated on real-world tasks, contributing to safer and more reliable AI development.

Profile

What makes you a great fit

  • Experience in AI evaluation, benchmarking, or related research.
  • Strong data analysis skills and ability to automate workflows.
  • Excellent communication skills for reporting and visualization.

Benefits

What's in it for you

Compensation and benefits not specified in the posting.

About

Inside Epoch AI

Visit site →

Epoch AI is a research organization focused on analyzing and forecasting the trajectory of AI development, particularly in the context of AI safety and governance.