Crédit photo: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source
Most coverage of OpenAI's recent enterprise announcements has focused on the headline number: one million business customers. It's a big, round figure that makes for good press releases and investor decks. But to be precise, this number tells us almost nothing about what's actually happening with AI adoption in organizations, and the more interesting developments are buried in the details that most outlets glossed over.
I've spent the past week reading through OpenAI's various announcements, from the BNY partnership to the UK government deal to the Singapore initiative, and what strikes me isn't the scale claims. It's the shift in how OpenAI is positioning itself. This is a company that built its reputation on research papers and model releases. Now it's talking about "company-wide AI agents" and "AI-powered work." That transition deserves more scrutiny than it's getting.
Let's start with the number everyone's citing. One million business customers sounds impressive until you ask what counts as a "business customer." OpenAI hasn't disclosed this, which is, I know I'm being picky here, but a fairly significant omission. Does a solo freelancer with a ChatGPT Plus subscription count? What about a ten-person startup where one employee uses the API for a side project? The company's own blog post mentions "ChatGPT and our APIs" together, suggesting these are aggregated figures.
This matters because the gap between "someone at a company pays for ChatGPT" and "an organization has deployed AI systematically" is enormous. The former is interesting for OpenAI's revenue. The latter is interesting for understanding how work is actually changing.
À lire aussi
More in AI Models
Everyone's covering the parental controls. The real story is how OpenAI is trying to solve an almost impossible problem: age verification without surveillance.
James Chen · 1 hour ago · 7 min
The company is rapidly expanding where customer data can live, but the real question is whether this solves the problems enterprises actually have.
James Chen · 1 hour ago · 5 min
Three announcements in quick succession reveal OpenAI isn't just scaling up, it's building the backbone for AI that needs to think and respond in real-time.
Sarah Williams · 1 hour ago · 6 min
A string of partnerships with Foxconn, the DOE, and governments worldwide suggests OpenAI is becoming something very different from what it started as.
The BNY case is more instructive here. According to OpenAI's announcement, over 20,000 employees at the financial services firm now use "Eliza," an internal platform built on OpenAI's technology. That's roughly 40% of BNY's workforce, if the company's public headcount figures are accurate. More notably, these employees aren't just using ChatGPT for ad hoc queries. They're building AI agents, which is a qualitatively different kind of adoption.
But we should be careful about generalizing from one case study. BNY is a financial services company with significant technical resources and, presumably, a substantial budget for this kind of initiative. Whether similar adoption patterns hold for, say, mid-sized manufacturing firms or healthcare providers remains unclear.
OpenAI's recent blog post on enterprise AI outlines what the company calls "the next phase." Reading between the lines, this appears to be a shift from selling API access to selling something closer to an enterprise platform. The post mentions Frontier (their premium tier), ChatGPT Enterprise, Codex, and "company-wide AI agents" as distinct offerings.
The agent framing is worth unpacking. It's worth noting that "AI agent" has become one of those terms that means different things to different people. In academic contexts, it typically refers to systems that can take autonomous actions over extended time horizons. In enterprise marketing, it often means something closer to "automated workflow with an LLM in the middle."
From what I can gather, OpenAI's enterprise agents seem to fall somewhere between these definitions. The BNY case describes employees "building AI agents," which suggests some level of customization and autonomy, but the examples given (improving efficiency, enhancing client outcomes) are vague enough to encompass anything from sophisticated multi-step reasoning systems to glorified chatbots with extra prompting.
This isn't necessarily a criticism. Incremental improvements to existing workflows can be genuinely valuable. But the language around agents has gotten so inflated that it's hard to know what's actually being deployed. I'd want to see more specific descriptions of what these agents do, what decisions they make autonomously, and what guardrails exist.
Two of OpenAI's recent announcements involve government partnerships, one with the UK and one with Singapore. These have been covered as major policy developments, but the details are thin enough that it's difficult to assess what's actually been agreed to.
The UK partnership is described as a "strategic partnership to deliver AI-driven growth." This includes vague commitments to "boost AI adoption," "drive economic growth," and "enhance public services." The announcement mentions a "thriving AI ecosystem" but provides no specific deliverables, timelines, or funding figures. Actually, the research shows that these kinds of framework agreements often amount to less than the press releases suggest. They establish intent and create a structure for future negotiations, but the substantive decisions happen later, if at all.
The Singapore initiative is slightly more concrete, describing a "multi-year AI partnership" with goals around deployment, talent development, and support for businesses and public services. But again, the specifics are missing. How many people will be trained? What public services will be affected? What does "support" actually mean in practice?
I'm not suggesting these partnerships are meaningless. Government engagement with frontier AI companies is important, and establishing relationships early can shape how regulations develop. But the coverage has treated these announcements as bigger news than the available evidence supports. We don't know yet what these partnerships will produce.
Buried in a separate leadership update, OpenAI acknowledges something interesting: "We remain focused on the same core, pursuing frontier AI research that accelerates human progress, but we now also deliver products used by hundreds of millions of people."
This is a significant admission, even if it's framed positively. OpenAI started as a research lab. It's now a company that has to balance research priorities with the demands of maintaining products at massive scale. These are different organizational challenges, and the history of tech companies suggests that the product side tends to win over time. Resources flow toward what generates revenue.
The sample size is small, but we've seen this pattern before. Google's research division has done groundbreaking work, but the company's priorities are clearly shaped by its advertising business. Facebook's AI research team produced important papers, but the organization's focus is on engagement metrics. Whether OpenAI can maintain its research edge while also running a billion-dollar enterprise business is an open question.
This isn't to say the transition is bad. A sustainable business model might actually enable more research in the long run. But the tension is real, and the leadership update's acknowledgment of it suggests the company is at least aware of the challenge.
Several things I'd want to know that none of these announcements address:
First, deployment outcomes. OpenAI provides plenty of adoption metrics (users, customers, employees) but almost nothing about what happens after deployment. Are these tools actually improving productivity? By how much? The BNY case mentions "efficiency" and "client outcomes" but provides no quantification. Given the significant investment these deployments require, this is a notable gap.
Second, failure modes. Any technology deployed at this scale will have failures. What goes wrong? How often? What are the error rates on these enterprise agents? The absence of this information makes it hard to assess whether the adoption numbers represent successful integration or widespread experimentation with mixed results.
Third, competitive dynamics. OpenAI isn't the only company pursuing enterprise AI. Anthropic, Google, Microsoft (which has a complicated relationship with OpenAI itself), and various startups are all competing for the same customers. The announcements present OpenAI's growth as if it's happening in a vacuum, but enterprise decisions involve trade-offs between vendors. Understanding why companies choose OpenAI over alternatives would be more informative than knowing how many have.
Fourth, and this is something I think about a lot, the research implications. If OpenAI is increasingly focused on enterprise products, what does that mean for the kind of research it prioritizes? Enterprise customers want reliability, consistency, and integration with existing systems. These are different optimization targets than capability improvements or novel architectures. The research agenda may shift accordingly.
The enterprise AI space is moving quickly, and it's genuinely hard to assess what's happening without better data. A few things that would help:
Independent audits of deployment outcomes. Not case studies selected by vendors, but systematic assessments of how AI tools perform across a range of organizations. This would help separate genuine productivity improvements from hype.
More granular adoption metrics. "One million business customers" is a marketing number. Breaking this down by organization size, industry, use case, and depth of integration would be far more informative.
Longitudinal studies. We're still early enough in enterprise AI adoption that most assessments are snapshots. Understanding whether early enthusiasm translates into sustained use requires tracking deployments over time.
Failure case analyses. The AI safety community has pushed for transparency about model failures. Similar transparency about deployment failures would help organizations make better decisions about adoption.
I don't expect OpenAI to provide all of this. Companies have legitimate reasons to protect competitive information. But the gap between the confident claims in these announcements and the evidence available to evaluate them is wider than the coverage suggests.
The enterprise AI story is important. It's where the abstract capabilities of language models meet the messy realities of how organizations actually work. But telling that story well requires more skepticism about headline numbers and more attention to the details that actually matter. One million business customers is a data point. What those customers are doing, whether it's working, and what it means for the future of work, those are the questions we should be asking.