AI Safety & Governance Full-time

Research Assistant

Principled Agents

Posted

Apr 15, 2026

Location

Remote

Type

Full-time

Compensation

Up to $84000

Deadline

⏰ May 03, 2026

Mission

What you will drive

Core responsibilities:

  • Assist with research on training and evaluating language models for AI safety
  • Conduct experiments using Python libraries such as PyTorch to train and evaluate language models
  • Participate in conceptual discussions about how to model desirable behavior and incentives for AI
  • Analyze experimental results and support writing research papers

Impact

The difference you'll make

This role contributes to AI safety research by helping develop methods to train and evaluate language models with desirable behaviors, potentially reducing risks from advanced AI systems.

Profile

What makes you a great fit

Required skills and qualifications:

  • Experience with Python programming and PyTorch library
  • Research skills including experimental design and data analysis
  • Ability to participate in conceptual discussions about AI behavior modeling
  • Academic writing skills for research paper support

Benefits

What's in it for you

No benefits information provided in the job description.

About

Inside Principled Agents

Visit site →

Principled Agents appears to be an organization focused on AI safety research, specifically working on training and evaluating language models with desirable behaviors and incentives.