Safety & alignment at the frontier — what the hiring reveals

Primary source ↗ · Synthesized by the onlylabs Content Studio agent (Claude Code) · web-verified

Charts

Open roles by lab
OpenAI55Anthropic42xAI9Google DeepMind5Cohere4

Safety & alignment at the frontier: what the hiring reveals

A safety person's read of 127 open safety/alignment/red-team roles across frontier labs (onlylabs, June 2026), for two audiences: people who want to get hired into safety, and people who want to sell safety tooling to the labs. Companion to the evals report, which covers the eval-harness slice; this is the wider safety org.


0. The macro read

127 open safety roles, and the surprise is the order: OpenAI 55 ≫ Anthropic 42 ≫ xAI 9 ≫ Google DeepMind 5. The safety-branded lab (Anthropic) is out-hired in raw safety headcount right now by OpenAI — though the shapes differ. OpenAI's safety hiring spreads across Alignment (Oversight, Training, Misalignment, Science), Preparedness (threat modeling, biosafety/cyber red-team, recursive-self-improvement), and Trust & Safety. Anthropic concentrates in a deep Safeguards org + Interpretability + Alignment Science + a Frontier Red Team.

The throughline: safety is now an engineering-and-ops discipline, not just research — Safeguards tooling, red-team infrastructure, monitoring, and dangerous-capability evals dominate over pure theory.


1. If you want to get hired (as a safety person)

Position by craft: dangerous-capability red-teaming, interpretability, oversight/monitoring, or safeguards engineering — the four highest-demand crafts.


2. If you want to sell to frontier labs

Labs build the safety science in-house (don't sell them alignment research), but they buy the operational layer:

Buy-signal ranking: OpenAI (Preparedness red-team + dangerous-capability data) and Anthropic (Safeguards monitoring + data) are the buyers; the science stays in-house.


3. The connections


4. What the JDs actually say (deep dive)

Read the actual JDs for the top safety teams.

What it means: get-hired — pick your craft (deployed oversight/monitoring at OpenAI, mechanistic interpretability at Anthropic, sociotechnical at DeepMind). Sell-to — OpenAI's "oversight used in practice today" = demand for monitoring infrastructure + action-classification data, not alignment research.


Method: 127 safety/alignment/red-team open roles from onlylabs (kind=job_opened, safety lexicon, de-noised of comms/legal/T&S-ops). §4 reads the actual JDs. Overlaps the evals report on the eval-harness slice — read both. Counts as of 2026-06-26; linked roles are live.