OpenAI's Codex Agent Is Impressive Engineering, But I've Seen This Movie Before

The new cloud-based coding agent represents real technical progress, but let's pump the brakes on the 'end of programming' takes.

By Mark Kowalski

9 hours ago6 min de lecture

Crédit photo: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source

Codex-1 is trained using reinforcement learning on real-world coding tasks. It generates code that "closely mirrors human style," adheres to instructions, and runs tests iteratively until they pass. That's the pitch from OpenAI's latest system card addendum, and honestly, it's not nothing.

But here's the thing: I've been covering tech since the 90s, and I've watched at least three generations of tools that were supposed to make programmers obsolete. CASE tools in the early 90s. Visual Basic's drag-and-drop revolution. Low-code platforms that were going to let "citizen developers" build enterprise software. Each time, the tools got absorbed into workflows, made some tasks easier, and programming jobs kept growing. Call me old-fashioned, but I'm skeptical this time is fundamentally different.

What Is Codex Actually Doing?

Let's start with the technical architecture, because OpenAI published a surprisingly detailed breakdown of what they call the "agent loop." The Codex CLI orchestrates models, tools, prompts, and performance using something called the Responses API. That's a lot of jargon, so let me translate.

Codex is essentially a sophisticated automation layer. You give it a task (fix this bug, add this feature, refactor this module), and it spins up a cloud sandbox, writes code, runs your test suite, reads the output, adjusts, and repeats until the tests pass. The "agent" part means it's not just generating code in one shot, it's iterating. Trying things. Failing and trying again.

This is genuinely more sophisticated than what we had two years ago! The reinforcement learning approach, where the model was trained on actual coding tasks rather than just predicting the next token in a code file, seems to produce more coherent multi-step behavior. The system card mentions that codex-1 is "a version of OpenAI o3 optimized for software engineering," which suggests they've done substantial fine-tuning beyond the base reasoning model.

More in AI Models

When a company raising $122 billion suddenly announces a billion-dollar charitable foundation, an old robotics hand can't help but squint a little.

Robert "Bob" Macintosh · 1 hour ago · 3 min

The company published detailed guidelines for how its models should behave. The document is surprisingly thoughtful, but the real test is whether it actually constrains anything.

Aisha Patel · 1 hour ago · 8 min

The AI company is giving away software to lock in government and healthcare customers. I've seen this playbook before.

Robert "Bob" Macintosh · 1 hour ago · 3 min

The company just raised $122 billion and is now pledging at least $1 billion for disease cures and community programs. The numbers are big, but what do they actually mean?

OpenAI's Codex Agent Is Impressive Engineering, But I've Seen This Movie Before

What Is Codex Actually Doing?

More in AI Models

The Self-Driving Car Parallel

What's Actually New Here?

The Young Founders Problem

So What Should We Expect?

Sources