Crédit photo: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source
Forty percent. That's the gap OpenAI claims exists between what current AI systems can do and what most organizations actually use them for. I initially thought that number seemed high, maybe a bit self-serving coming from a company that sells AI tools. But after digging into their new reports, I think they might actually be underselling the problem.
The company dropped two related pieces this week: a global adoption study called "How countries can end the capability overhang" and an enterprise-focused report on The state of enterprise AI. Together, they paint a picture of a technology that's moved faster than the institutions trying to use it. Which, honestly, tracks with what I've been hearing from people building in this space.
The capability overhang, explained. OpenAI's framing here is interesting. They're arguing that we've hit a weird moment where the limiting factor isn't the AI itself, it's everything around it: training, integration, regulatory clarity, and frankly, organizational willingness to change how work gets done. The models are ready. The world isn't.
This isn't entirely new thinking. We've seen versions of this argument from economists studying general purpose technologies (electricity took decades to reshape factories, etc.). But OpenAI is making a more urgent claim: that the overhang is actively widening because model capabilities keep improving while adoption crawls along.
You might be wondering why OpenAI, of all companies, is publishing policy-adjacent research about national AI strategies. The cynical read is obvious: they want governments to buy more ChatGPT Enterprise licenses. But I think there's something else going on too. They're trying to shift the conversation from "should we deploy AI" to "how do we deploy AI faster," which is a very different framing than the safety-first discourse that's dominated the past two years.
À lire aussi
More in AI Models
ChatGPT Health looks polished, but anyone who's watched enterprise software enter hospitals knows the real test comes later.
Robert "Bob" Macintosh · 1 hour ago · 4 min
A new study claims to show how ChatGPT creates economic value, though the research design leaves some important questions unanswered.
Aisha Patel · 1 hour ago · 7 min
CyberAgent's rollout of ChatGPT Enterprise reminds me of watching PLCs spread through manufacturing in the 90s, for better and worse.
Robert "Bob" Macintosh · 1 hour ago · 3 min
A single model that handles vision, audio, and language at once sounds great on paper. I've heard that pitch before.
What the enterprise data actually shows. The enterprise report is more concrete, and tbh more interesting to me. OpenAI pulled data from their own customer base (which introduces some selection bias, obviously) and found a few patterns worth noting.
First, organizations that moved past the "experimentation" phase saw productivity gains that compounded. The report doesn't give exact figures here, which is frustrating, but they describe a pattern where initial pilots lead to expanded deployments, which lead to workflow redesigns, which lead to larger gains. This matches what I've heard from enterprise AI teams: the first use cases are rarely the most valuable ones. They're just the door openers.
Second, and this surprised me a bit, the biggest predictor of successful adoption wasn't technical sophistication. It was executive sponsorship combined with clear metrics. Companies that said "we're going to measure X and Y" did better than companies that said "let's see what AI can do." Which sounds obvious when you write it down, but apparently most organizations still start with the vague approach.
Third, integration depth matters more than breadth. Companies using AI in three workflows deeply outperformed companies using it in fifteen workflows superficially. Again, not shocking, but it's useful to see it quantified (or at least described, since the actual numbers aren't public).
The country-level stuff is where it gets political. The global report is where OpenAI is clearly trying to influence policy. They're essentially arguing that national AI strategies need to focus less on building foundation models and more on deploying existing ones. This is, conveniently, exactly what you'd expect OpenAI to say given that they already have the models.
But I don't think that makes them wrong. Most countries aren't going to build competitive frontier models. The compute costs alone are prohibitive. So the question becomes: if you're not building, how do you capture value? OpenAI's answer is basically "make it easier to adopt our stuff," which includes things like regulatory sandboxes, public sector pilots, and workforce training programs.
The report outlines what they're calling new initiatives to help countries close the capability gap. Details are thin, it reads more like a framework than a concrete program. But they're clearly positioning themselves as partners to governments, not just vendors. Whether that's genuine or strategic (probably both?) remains unclear.
What's missing from these reports. I should know this better, but I couldn't find any discussion of the downsides of rapid AI adoption in either document. Job displacement gets a brief mention in the context of "workforce transitions," but there's no serious engagement with the possibility that closing the capability overhang might have costs alongside benefits.
There's also no acknowledgment of the concentration effects. If every country adopts AI faster, but that AI comes from three or four companies in two countries, what does that mean for technological sovereignty? OpenAI doesn't address this, which is understandable (they'd be arguing against their own business model) but notable.
And honestly, the enterprise report suffers from a classic problem: they're measuring their successful customers. We don't know how many companies tried to adopt AI deeply and failed, or what failure modes look like. Selection bias isn't fatal to their conclusions, but it should make us hold them loosely.
The productivity gains question. Here's what I keep coming back to: are the productivity gains real and durable, or are we in a honeymoon period where everything looks promising because it's new?
The enterprise report claims measurable gains, but measurable by whom and how? If a company reports that their customer service team handles 30% more tickets with AI assistance, is that because AI is genuinely productive, or because they've lowered quality standards, or because they've shifted work to customers ("please try our AI chatbot first")? These distinctions matter and they're hard to measure from the outside.
I'm not saying the gains are fake. I think there are real productivity improvements happening in specific use cases. Code generation is the obvious one. Document summarization and search seem legitimate. Some customer service applications work. But the broad claims about economy-wide productivity transformations? It's too early to say. We're maybe 18 months into widespread deployment. The data just isn't there yet.
So what should we make of this? OpenAI is doing something clever with these reports. They're positioning themselves as thought leaders on AI adoption, not just AI capability. That's a smart move as the conversation shifts from "can AI do X" to "should organizations use AI for X and how."
But I think we should read these reports as what they are: advocacy documents from an interested party. The framing of a "capability overhang" that needs to be "ended" assumes that faster adoption is better. That's not obviously true. It might be true. But it's a claim that needs defending, not a premise we should accept.
The most useful thing in these reports is probably the enterprise adoption patterns. Even accounting for selection bias, understanding what separates successful AI deployments from failed ones is genuinely valuable. Executive sponsorship, clear metrics, deep integration over broad experimentation, these are actionable insights.
As for the policy recommendations, I'm more skeptical. Not because they're wrong exactly, but because they're incomplete. A serious conversation about national AI strategy needs to include questions OpenAI isn't asking: about dependency, about labor impacts, about whether "ending the capability overhang" is actually the right goal.
But that's probably asking too much from a corporate research publication. They've given us data and a framework. The harder questions are ours to figure out.