OpenAI previewed GPT-5.6 on June 26, and the most important fact about it is one you cannot act on. The best version, Sol, is locked to roughly 20 approved companies by a US government order. The model everyone is talking about is one almost no one can use.

Look past the gate and the real news is the shape. GPT-5.6 is not one model, it is a three-tier family built for agentic, tool-heavy work, with a new mode that spins up subagents inside the model itself. Greg Brockman, OpenAI's president, summed it up in three words. "A good model." That is the understatement. Here is what actually changed, what it costs, and the part you need to know before you plan around it.

It Is Three Models, Named for the Sky

GPT-5.6 is not one model. It is a family of three: Sol, Terra, Luna. Sun, Earth, Moon. The naming is not decoration. It tells you exactly where to point each tier, and what each one costs.

Sol
Sun · Frontier
$5 in   $30 out

The frontier. The only tier with max and ultra reasoning, and the only one gated.

Terra
Earth · Everyday
$2.50 in   $15 out

The everyday default. The engine inside most Codex sessions.

Luna
Moon · Volume
$1 in   $6 out

Fast and cheap. High-volume work that runs in the background.

Per million tokens. Frontier to volume, about 5x apart end to end. Cached input reads are 90% off.

The spread is deliberate. Sol is the frontier, the only tier with the new max and ultra reasoning modes, and the only one that is gated. Terra is the everyday default that quietly powers most Codex sessions. Luna is the volume play. You will almost certainly build on Terra and Luna, which is why the gate on Sol matters less than the headlines suggest. More on the gate below.

The Real Headline: Subagents in the Model

Sol ships with two new reasoning effort settings that sit above the old high. The first, max, gives the model the most time it has ever had to reason through a single problem before answering. That is the expected move, more thinking for harder questions.

The second is the one that matters. Ultra goes beyond a single agent. Instead of one model reasoning longer, ultra spins up subagents and parallelizes the work across them, then pulls the results back together. Orchestration that teams used to build by hand, with frameworks and glue code, is now a setting on the model itself.

Fig. 1
The reasoning ladder, and where ultra breaks the pattern
none low medium high max one agent, longest think ultra subagent subagent subagent max thinks longer. ultra thinks wider.
Effort used to mean one agent reasoning longer. Ultra makes it many agents reasoning in parallel.

Caching That Behaves

Less flashy, more useful day to day: prompt caching got predictable. GPT-5.6 adds explicit cache breakpoints, so you decide where the cached prefix ends, and a 30-minute minimum cache life, so a cached context does not silently evaporate between calls. Reads off the cache are 90% cheaper than fresh input.

If you run agents, this is the quiet win. Agent loops re-send the same instructions, tools, and context on every turn. With reliable caching and a real read discount, the standing context an agent carries stops being the expensive part of the bill. The model is finally priced the way agents actually use it.

The Benchmarks

OpenAI points to gains in coding, biology, and cybersecurity. The number worth holding onto is agentic: Sol sets a new state of the art on Terminal-Bench 2.1, the benchmark that tests command-line workflows requiring planning, iteration, and tool coordination. That is the right thing to measure for a model built to drive tools, not just answer questions.

Coding
Terminal-Bench 2.1

Sol takes the top spot on the agentic CLI benchmark: plan, run, read the result, fix, repeat. The workflow real agents live in.

Reasoning
max and ultra

New effort ceilings on Sol. Max for the hardest single problem, ultra for work that splits across subagents.

Safety
Harder guardrails

Strengthened safeguards for higher-risk activity, sensitive cyber requests, and repeated misuse ship with Sol.

The Catch: Sol Is Gated

Here is the part that changes how you should plan. GPT-5.6 launched as a limited preview, and the frontier Sol tier is not open. Per Dan Shipper of Every, a US government directive limited access to Sol to roughly 20 pre-approved companies on temporary national-security grounds. It mirrors the June export restrictions on Anthropic's most capable models. Terra and Luna are not gated, and general availability for all three is promised in the coming weeks.

Dan Shipper
Dan
Shipper

"The top tier is gated to a short list of approved companies, on national-security grounds. The everyday tiers are what most teams will actually build on."

Dan Shipper, Every  ·  June 2026
Gated

Sol, the frontier tier

Who gets it
Roughly 20 approved companies during the preview, under a US directive.
What you lose without it
Max and ultra reasoning, the absolute top of the agentic benchmarks.
Open

Terra and Luna

Who gets it
Broadly available, with full GA promised in the coming weeks.
Why it is enough
Terra is the engine inside most Codex sessions. It is what most companies will ship on anyway.
Greg Brockman
Greg
Brockman

"A good model."

Greg Brockman, OpenAI  ·  June 2026

What This Means for Your Company

Skip the temptation to chase the gated tier. The interesting move in GPT-5.6 is not Sol, it is the direction of travel: a model with subagents built in, a default tier good enough to run real work, and caching priced for agent loops. The frontier and everyday tiers are converging, and you will run whatever is good and available when you build.

So the model is not the decision. The decision is whether your functions are ready for one. Clean context, defined goals, connected data, a clear definition of done. Get that right for one function and the same playbook repeats across the next, whatever model is sitting underneath. That is the work, and it is the work nativefirst does on site.

The tiers will converge. The readiness is on you.

Book a free Diagnostic: 30 to 45 minutes, no deck, no pitch. We map which functions in your company are ready for an agent, what the data and context layer looks like, and which one to ship first so the rest can follow.

Book the Diagnostic →
Sources
1OpenAI, "Previewing GPT-5.6 Sol," June 2026. Three-tier family (Sol, Terra, Luna), new max and ultra reasoning effort, ultra running subagents, predictable prompt caching with explicit breakpoints and a 30-minute minimum cache life, and Sol setting a new state of the art on Terminal-Bench 2.1.
29to5Mac, June 26, 2026. GPT-5.6 limited preview, per-million-token pricing for Sol ($5 / $30), Terra ($2.50 / $15), and Luna ($1 / $6), cache read discount, and general availability planned for the coming weeks.
3Dan Shipper, Every, June 2026. On the GPT-5.6 Sol access restriction: a US government directive limiting the top tier to roughly 20 approved companies, mirroring June restrictions on Anthropic's most capable models.
4Greg Brockman, OpenAI, on X, June 2026. "A good model," on the GPT-5.6 release.
John Tan
John Tan

Founder and CEO of nativefirst.ai. Embeds with scaling founders and CEOs to ship Level-3 agents and AI workflows in production.