OpenAI Runs on Codex Now. The Whole Company, Not Just Engineers.

The headline most people read in June was the model number. OpenAI previewed GPT-5.6, a new three-tier line built for agentic, tool-heavy work. That is the smaller story. The bigger one is what happened to the tool that runs on it.

According to Andrew Ambrosino, who leads the Codex desktop app, nearly 100% of OpenAI employees now use Codex every week. Not the engineers. All of them. Marketing, comms, finance, legal. The company crossed 5 million weekly active users on Codex, growing 6x since January. A product named after code became the way an entire frontier lab does its work.

For a founder watching this, the interesting question is not "how good is the model." It is "how did a coding tool become the default interface for a whole company, and could that happen to mine."

Fig. 1

Not an engineering tool anymore

~100% weekly use across functions, 5M+ weekly active users, 6x growth since January.

What Actually Shipped: GPT-5.6

The model line underneath Codex moved. GPT-5.6 arrived as a three-tier family, each tuned for a different job:

Sol, the frontier tier. The most capable, for the hardest agentic and reasoning work.
Terra, the balanced everyday tier. The one most teams will run by default.
Luna, the fast and affordable tier, built for high-volume, tool-heavy tasks.

Greg Brockman, OpenAI's president, called it simply "a good model." But the rollout came with a catch worth understanding. Per Dan Shipper of Every, a US government directive limited access to the top Sol tier to roughly 20 pre-approved companies, on temporary national-security grounds. It mirrors the June export restrictions on Anthropic's most capable models. The frontier tier is gated. The everyday tiers, Terra and Luna, are not, and they are what most companies will actually build on.

GPT-5.6, three tiers

Sol

Frontier tier for the hardest agentic work. Access currently gated to roughly 20 approved companies.

Terra

Balanced everyday tier. The default for most teams and the engine inside most Codex sessions.

Luna

Fast and affordable. Built for high-volume, tool-heavy tasks that run constantly in the background.

How a CLI Became a Company OS

Codex started as a terminal tool for developers. It should have stayed niche. Engineers live in the command line, finance and legal do not. So why did everyone adopt it?

Because the surface changed. Codex grew from a CLI into a full desktop app with computer use, an in-app browser, and the ability to drive Chrome through an extension. Once an agent can open a browser, read a document, fill a form, and take actions in real apps, the job stops being "writing code" and becomes "doing the work." Three capabilities did most of the lifting:

Record & Replay

Demonstrate a recurring task once, an expense report, a time-off request, and Codex turns the demo into an inspectable, editable skill. Skill capture moves from writing prompts to showing the work.

The loop pattern

One goal-loop prompt enumerates every case, tests it, fixes what breaks, and re-tests. Brockman demoed it running hundreds of user stories against an app autonomously.

Computer use

A built-in browser and app control mean workflows run against the tools you already have, instead of waiting on an API for every integration.

That last point is why non-engineers came aboard. When Peter Yang, a product leader, switched from Claude Code to Codex, the reason was not raw model quality. It was reach.

Peter
Yang

"I built so many workflows relying on those two things, browser and computer use, instead of hunting for APIs."

@petergyang · June 2026

The Record & Replay release landed the same way with practitioners. Dan Shipper, who runs Every and uses Codex daily, reacted in three words.

Dan
Shipper

"Extremely sick."

@danshipper · On Codex turning a demoed task into an editable skill

The Story Underneath the Numbers

Strip the product names away and the pattern is the one that matters for your company. A frontier lab did not roll out agents department by department through a transformation program. It shipped one tool good enough that every function adopted it on its own, then watched the work reorganize around it.

The old read

Codex is a faster IDE

Who uses it

Engineers. A productivity tool for the people who already write code.

What it does

Writes and ships code faster. Stays inside the engineering org.

Scarce skill

Knowing how to code. The bottleneck is implementation.

What June showed

Codex is the company's interface

Who uses it

Everyone. Marketing, finance, legal, comms, ops, alongside engineering.

What it does

Runs the work. Coordination, research, reporting, and operations, not just code.

Scarce skill

Taste. Implementation is cheap, so knowing what good looks like is the bottleneck.

Ambrosino's framing for the shift is worth keeping: when implementation gets cheap, the scarce skill becomes taste, knowing what done looks like and curating toward it. His stated goal for Codex is to build the best desktop app that has ever existed, a home base that orchestrates every other tool. OpenAI is dogfooding that bet on itself, in public, as the clearest live example of an agent-native company.

What This Means for Your Company

OpenAI is a special case. It builds the models, so of course it runs on them. But the mechanism that made adoption stick is not special, and it is the part worth copying.

Codex spread because two conditions were met. The tool could take real actions in real systems, and the context it needed, the goals, the institutional knowledge, the linked data, was within reach. Where those two conditions hold, agents move from engineering into every function. Where they do not, you get a pilot in one team and a stalled rollout everywhere else.

That is the work. Not picking the model, the tiers are converging and you will run whatever is good and available. The work is making each function legible enough that an agent can act inside it: clean context, defined goals, connected data, a clear definition of done. Get that right for one function and the same playbook repeats across the next. That is exactly how the rollout runs, function by function, and it is the work nativefirst does on site.

The model is not the moat. The rollout is.

Book a free Diagnostic: 30 to 45 minutes, no deck, no pitch. We map which functions in your company are ready for an agent, what the data and context layer looks like, and which one to ship first so the rest can follow.

Book the Diagnostic →

Sources

1Andrew Ambrosino (Codex desktop lead) on Lenny Rachitsky's podcast, June 2026. Codex adoption across every department at OpenAI; implementation cheap, taste scarce; goal of "the best desktop app that has ever existed."

2Greg Brockman, OpenAI, internal Codex adoption data shared on X, June 2026. Company-wide use for complex, long-running, cross-functional work; the goal-loop ("/loop") demo for testing every feature in an app.

3Dan Shipper, Every, June 2026. On the GPT-5.6 Sol access restriction (US government directive, roughly 20 approved companies) and on Codex Record & Replay turning a demoed task into an editable skill.

4Peter Yang, June 2026. On switching from Claude Code to Codex for its browser and computer-use workflows.

5OpenAI security push, June 22, 2026: GPT-5.5-Cyber, "Patch The Planet," and Codex Security, moving from finding to solving security problems.

John Tan

Founder and CEO of nativefirst.ai. Embeds with scaling founders and CEOs to ship Level-3 agents and AI workflows in production.