Infra & systems at the frontier — what the hiring reveals

Primary source ↗ · Synthesized by the onlylabs Content Studio agent (Claude Code) · web-verified

Charts

Open roles by lab
OpenAI138CoreWeave96Nebius86Anthropic74Cerebras40DigitalOcean34Together AI34xAI29Baseten24

Infra & systems at the frontier: what the hiring reveals

An infra person's read of 715 open infrastructure/systems roles across frontier labs and neoclouds (onlylabs, June 2026), for two audiences: people who want to get hired, and people who want to sell infra to the labs. Every cited role links to its live posting.


0. The macro read

The frontier infra buildout is physical first. The largest category by far is data center / hardware (199) — the power, cooling, facilities, and networking of the GPU buildout — ahead of platform/SRE (95), inference/serving (87), and GPU/kernels/perf (81). Training systems is the smallest (15) — labs guard the core training stack tightly and hire it rarely and senior.

Two structural reads:

1. Neoclouds are data-center machines; labs are broad-stack. Nebius (53 of 86) and CoreWeave (50 of 96) are overwhelmingly data center/hardware — they're building physical GPU clouds. OpenAI (138) and Anthropic (74) spread across the whole stack (GPU, serving, platform, DC). The chip/serving shops — Cerebras, Baseten, Together — concentrate in inference/serving + kernels (their product). 2. The competitive layer is serving + performance. Inference/serving (87) and GPU/kernels/perf (81) are where Anthropic (15 serving), Cerebras (11+12), OpenAI (13 GPU), Together, and Baseten all hire — this is the layer where cost-per-token is won.

LabTrain sysInference/servingGPU/kernelsData center/HWPlatform/SRETotal
OpenAI48132916138
CoreWeave148501496
Nebius13531186
Anthropic115916774
Cerebras11126640
DigitalOcean831134
Together1864834
xAI1112329
Baseten1753324

1. If you want to get hired (as an infra/systems person)

Position by layer, then by lab:


2. If you want to sell to frontier labs (and neoclouds)

The build-vs-buy map:

Buy-signal ranking: Neoclouds (CoreWeave/Nebius/DigitalOcean) for physical + capacity; frontier labs (OpenAI/Anthropic) for kernels/observability/serving at the edges; serving shops (Cerebras/Baseten/Together) as both buyers of low-level tooling and competitors in serving.


3. The connections (signal → signal)


4. What the JDs actually say (deep dive)

Read the actual JDs for the top infra teams (Greenhouse + Ashby posting APIs).

What it means: the stack is bifurcating — labs go vertical (OpenAI silicon; internal training systems), neoclouds own the physical resale (CoreWeave/Nebius), and a funded serving-product layer (Baseten/Together/Cerebras) fights for the cost-per-token margin. Sell into whichever layer you're not competing with.


Method: 715 infrastructure/systems open roles pulled from onlylabs (kind=job_opened, infra lexicon, de-noised of non-eng roles), classified by layer. §4 reads the actual JDs for the top teams. Dossiers: OpenAI · CoreWeave · Nebius · Anthropic. Counts as of 2026-06-26; every linked role is a live posting.