AI Safety & Governance Full-time

Research Engineer, Frontier Red Team (Autonomy)

Anthropic

Location

USA

Type

Full-time

Posted

Jan 25, 2026

Compensation

USD 200000 – 200000

Mission

What you will drive

Core responsibilities:

  • Design and build autonomous AI systems that can use tools and operate across diverse environments—creating model organisms that help us understand and defend against advanced adversarial AI
  • Create evals and training environments to understand and shape agent behavior in desirable ways
  • Develop defensive agents that can detect, disrupt, or outcompete adversarial AI systems in realistic scenarios
  • Interface Claude with hardware platforms (e.g. robotics, physical systems) to understand cyberphysical risks and defenses

Impact

The difference you'll make

Your work will inform decisions at the highest levels of the company, contribute to public demonstrations that shape policy discourse, and help build technical defenses that could matter enormously as AI systems become more capable. This research makes the entire world safer in this era of advanced AI by understanding what these systems can do and building the defenses that matter.

Profile

What makes you a great fit

Required qualifications:

  • Strong software engineering skills, particularly in Python
  • Experience building and working with LLM-based agents or autonomous systems
  • Driven to find solutions to ambiguously scoped, high-stakes problems
  • Design and run experiments quickly, iterating fast toward useful results
  • Thrive in collaborative environments (we love pair programming!)
  • Care deeply about AI safety and want your work to have real-world impact

Benefits

What's in it for you

Competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Annual salary range: $350,000—$850,000 USD.

About

Inside Anthropic

Visit site →

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.