Impact Careers Full-time

Research Scientist – Science of Evaluation

AI Security Institute (AISI)

Posted

Feb 10, 2026

Location

UK

Type

Full-time

Compensation

£65000 - £145000

Mission

What you will drive

Core responsibilities:

Applied research on evaluation methodology, including developing new techniques and tools for measuring AI capabilities
Design and conduct experiments that extract deeper signal from evaluation data to uncover underlying capabilities
Run and analyze evaluation results to stress-test claims, characterize model capabilities, and inform policy-relevant reports
Track state of the art in frontier AI evaluation research and contribute to AISI's presence at ML conferences

Impact

The difference you'll make

This role creates positive change by developing rigorous techniques for measuring and forecasting AI capabilities, ensuring evaluation results are robust, meaningful, and useful for governance, which directly influences how frontier AI is governed and deployed globally.

Profile

What makes you a great fit

Required qualifications:

Strong track record in applied ML, evaluation science, or experimental fields with significant methodological challenges (PhD in technical field, top-tier publications, or substantial real-world deployments)
Significant hands-on experience with LLMs and agents
Strong motivation for impactful work at the intersection of science, safety, and governance
Self-directed and adaptable; comfortable with ambiguity in a growing team

Benefits

What's in it for you

Compensation and benefits:

Salary range: £65,000–£145,000 depending on level and experience
28.97% employer pension contribution on base salary
At least 25 days annual leave, 8 public holidays, extra team-wide breaks, and 3 days off for volunteering
Generous paid parental leave (36 weeks UK statutory leave + 3 extra paid weeks)
Hybrid working with flexibility for occasional remote work abroad
Modern central London office or option to work in other UK government offices
5 days off learning and development, annual stipends for learning, and conference funding
Pre-release access to multiple frontier models and ample compute resources

🤖 AI-Powered

🧮 Calculators & Quizzes

Research Scientist – Science of Evaluation

Mission

Impact

Profile

Benefits

Research Scientist – Science of Evaluation

Mission

Impact

Profile

Benefits

Unlock Your Impact Potential