AI Safety & Governance Full-time

Research Scientist, Societal Impacts

Anthropic

Location

USA

Type

Full-time

Posted

Jan 25, 2026

Compensation

USD 200000 – 200000

Mission

What you will drive

Core responsibilities:

  • Using observational tools like Clio to analyze real-world usage patterns and surface insights about how people interact with Claude
  • Building and running evaluations to assess Claude's behavior across key dimensions of its Constitution, such as safety and quality of advice in high-stakes situations
  • Partnering closely with fine-tuning, safeguards, policy, and interpretability teams to translate research insights into model improvements
  • Generating insights about the societal impact of Anthropic's systems and using this understanding to inform company strategy, research priorities, and policy positions

Impact

The difference you'll make

This role creates positive change by generating insights about the societal impact of AI systems and using this understanding to inform company strategy, research priorities, and policy positions, ultimately working to make AI safe and beneficial for society.

Profile

What makes you a great fit

Required skills and qualifications:

  • Experience working with machine learning systems and comfortable with technical infrastructure for interfacing with models
  • Background in machine learning, data science, or another technical field that involves generating insights from complex systems
  • Adaptable and collaborative, able to take direction and contribute to team priorities
  • Skilled at writing up and communicating results, even when they're null or unexpected
  • Passionate about translating research insights into actionable recommendations for improving AI systems and informing policy

Benefits

What's in it for you

Competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.

About

Inside Anthropic

Visit site →

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. They want AI to be safe and beneficial for users and society as a whole.