AI Safety & Governance Full-time

Program Manager, Responsible Scaling Policy

Anthropic

Location

USA

Type

Full-time

Posted

Jan 20, 2026

Compensation

USD 190000 – 190000

Mission

What you will drive

Core responsibilities:

  • Build relationships with internal experts on Safeguards, Security, Alignment, capability evaluations, and other relevant areas
  • Learn about ongoing questions and challenges faced by these teams, and help to find the best balance between AI risk reduction and pragmatism
  • Gather information from across the company and assist in analyzing relevant risks
  • Help to drive conversations about the ambitious-but-achievable plans we can make to level up our risk mitigations
  • Collaborate with our Policy team, advising on regulation relevant to RSPs
  • Generally work to identify the most promising risk mitigations to be further developed and/or exported to the rest of the AI industry

Impact

The difference you'll make

This role works on the nuts and bolts of AI risk reduction through Anthropic's Responsible Scaling Policy, helping to manage catastrophic risks from advanced AI systems and identify promising risk mitigations that could benefit the entire AI industry.

Profile

What makes you a great fit

Required qualifications:

  • Have (or can quickly develop) a deep understanding of major catastrophic risks potentially posed by advanced AI
  • Are passionate about reducing the harms and improving the benefits of AI and align with Anthropic's safety mission and commitment to responsible AI development
  • Are a pragmatic problem-solver who relishes the opportunity to find the best balance between the goal of reducing catastrophic risks and staying on the frontier of AI development
  • Can write quickly and clearly in a way that gives audiences an easy time understanding your reasoning and justifications for your claims
  • Are a flexible generalist who is excited to learn about many topics and evolve your work in unanticipated directions as we contend with a rapidly changing AI landscape
  • Are highly organized and excel at managing multiple projects simultaneously, maintaining clear priorities and delivering on tight timelines
  • Are comfortable operating in ambiguous environments, taking ownership of problems, and proactively driving initiatives with limited precedent
  • Are scrappy and hard-working, but also humble, genuinely collaborative, and a strong team player
  • Hold at least a Bachelor's degree in a related field

Benefits

What's in it for you

Competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Annual salary range: $190,000 - $285,000 USD.

About

Inside Anthropic

Visit site →

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.