Research Scientist/Engineer (Evaluations)
Apollo Research
Posted
Feb 13, 2026
Location
UK
Type
Full-time
Compensation
$135000 - $135000
Mission
What you will drive
Core responsibilities:
- Run pre-deployment evaluation campaigns on the most capable AI systems in the world, partnering with multiple labs to access a breadth of models
- Deep dive into AI cognition by scanning through thousands of model transcripts to surface behavioral patterns
- Build new evaluations for frontier risks, from designing novel test environments to scaling them across hundreds of distinct scenarios
- Automate and improve the evaluation pipeline, rethinking and reshaping it as new possibilities emerge
Impact
The difference you'll make
This role helps assess the risks posed by scheming AIs, with evaluations directly informing deployment decisions for the most capable AI systems in the world, contributing to AI safety and responsible development.
Profile
What makes you a great fit
Required skills and qualifications:
- Strong software engineering skills with experience shipping and maintaining production Python code
- Process optimization skills with a focus on improving workflows and reducing friction
- Data analysis and pattern recognition abilities to extract signal from large, messy datasets
- Excellent writing and communication skills to convey findings to both technical and non-technical audiences
- AI power-user with experience using different models and experimenting with new AI workflows
Benefits
What's in it for you
No specific benefits, compensation, or salary information mentioned in the job posting.
About
Inside Apollo Research
Apollo Research is primarily concerned with risks from Loss of Control in AI, particularly deceptive alignment/scheming, and works on detection, science, and mitigation of these risks through evaluations and collaboration with frontier AI companies.