AI Safety & Governance Full-time

Researcher, Evaluations

Epoch AI

Posted

Jul 01, 2026

Location

Remote

Type

Full-time

Compensation

Up to $200000

Mission

What you will drive

Evaluate frontier AI models on real-world tasks by curating and refining benchmark suites.
Create and curate an evaluation suite with realistic tasks that challenge practical AI capabilities.
Design grading rubrics and assess AI performance quantitatively and qualitatively on complex work.
Communicate research findings through reports, blog posts, and visualizations to inform stakeholders.

Impact

The difference you'll make

This role helps ensure frontier AI models are rigorously evaluated on real-world tasks, contributing to safer and more reliable AI development.

Profile

What makes you a great fit

Experience in AI evaluation, benchmarking, or related research.
Strong data analysis skills and ability to automate workflows.
Excellent communication skills for reporting and visualization.

Benefits

What's in it for you

Compensation and benefits not specified in the posting.

About

Inside Epoch AI

Visit site →

Epoch AI is a research organization focused on analyzing and forecasting the trajectory of AI development, particularly in the context of AI safety and governance.

🤖 AI-Powered

🧮 Calculators & Quizzes

Researcher, Evaluations

Mission

Impact

Profile

Benefits

About

Researcher, Evaluations

Mission

Impact

Profile

Benefits

About

Similar opportunities

Futures Team Member

AI Accountability Fellow

Volunteer, Onboarding Coordinator

Volunteer, Grant Prospecting Researcher

Unlock Your Impact Potential