AI Safety & Governance Full-time

Software Engineer, Safeguards Infrastructure

Anthropic

Location

USA

Type

Full-time

Posted

Jan 25, 2026

Compensation

USD 200000 – 200000

Mission

What you will drive

Core responsibilities:

  • Develop the foundational systems which power Safeguards, including infrastructure for data storage and management, metric and evaluation systems, and tooling for human and agentic review.
  • Ensure the day-to-day running of Safeguards systems and hold a high operational bar which serves both safety and customers while reducing the amount of human intervention and oversight required.
  • Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale.

Impact

The difference you'll make

This role creates positive change by building systems to detect unwanted model behaviors and prevent disallowed use of AI models, working to monitor models, prevent misuse, and ensure user well-being to uphold principles of safety, transparency, and oversight.

Profile

What makes you a great fit

Required qualifications:

  • Bachelor's degree in Computer Science, Software Engineering or comparable experience
  • 4-10+ years of experience in a software engineering position
  • Proficiency in Python
  • Ability to work across the stack
  • Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders

Benefits

What's in it for you

Competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.

About

Inside Anthropic

Visit site →

Anthropic's mission is to create reliable, interpretable, and steerable AI systems, wanting AI to be safe and beneficial for users and society as a whole.