Image credit: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source
OpenAI is moving fast, and I'm not convinced anyone is moving fast enough to keep up with the implications. Over the past several months, the company has announced a series of partnerships that, taken individually, seem reasonable enough: training programs for journalists, tools for government agencies, educational initiatives in schools. But when you step back and look at the full picture, what emerges is something that warrants considerably more attention than it's receiving. OpenAI is systematically embedding itself into the institutional infrastructure of democratic societies, and the research community has barely begun to grapple with what that means.
Let me be precise about what's happening. The company has launched OpenAI for Government, an initiative to bring its tools to U.S. public servants. It has partnered with the Greek government to deploy ChatGPT Edu in secondary schools under the banner of "OpenAI for Greece". It has created the OpenAI Academy for News Organizations, developed with the American Journalism Project and The Lenfest Institute, to train journalists and editors. It has expanded partnerships with publishers like Axios. Each of these initiatives comes wrapped in language about responsible use, public good, and supporting existing institutions. And to be fair, some of that language may well be sincere.
But sincerity is not the same as accountability, and good intentions do not substitute for rigorous evaluation. What strikes me most about these announcements is not what they contain but what they omit. Where are the independent assessments? Where are the pre-registered studies examining outcomes? Where is the peer-reviewed research on the effects of deploying large language models in classrooms, newsrooms, and government offices? The answer, to be blunt, is that it largely doesn't exist yet. We are conducting a massive, uncontrolled experiment on critical institutions, and we're doing it without the methodological safeguards that we would demand in almost any other domain.
Related coverage
More in AI Models
Everyone's covering the parental controls. The real story is how OpenAI is trying to solve an almost impossible problem: age verification without surveillance.
James Chen · 47 mins ago · 7 min
The company is rapidly expanding where customer data can live, but the real question is whether this solves the problems enterprises actually have.
James Chen · 47 mins ago · 5 min
Three announcements in quick succession reveal OpenAI isn't just scaling up, it's building the backbone for AI that needs to think and respond in real-time.
Sarah Williams · 48 mins ago · 6 min
A string of partnerships with Foxconn, the DOE, and governments worldwide suggests OpenAI is becoming something very different from what it started as.
I know I'm being picky here, but the details matter. When OpenAI announces a partnership with the Greek government to bring AI into secondary schools, the stated goals are to boost AI literacy, support local startups, and drive economic growth. These are admirable objectives. But the research on AI in education is, at best, preliminary. A handful of studies have examined the effects of AI tutoring systems on student learning outcomes, and the results are mixed. Some show modest gains in specific contexts; others show no significant effect; a few raise concerns about decreased engagement with human instructors. The sample sizes are often small. The interventions are heterogeneous. Most critically, almost none of this research examines the specific case of deploying general-purpose large language models in secondary education at national scale. We simply don't know what happens when you do that.
The situation in journalism is similarly underexplored. OpenAI's Academy for News Organizations offers training, practical use cases, and responsible-use guidance. CNA, a Singaporean news network, has been highlighted as a case study in AI adoption, with Editor-in-Chief Walter Fernandez sharing insights on the transformation of their newsroom. But what does transformation actually mean in practice? Are AI tools being used for research assistance, for drafting, for editing, for fact-checking? How are accuracy rates affected? What happens to the skills of junior reporters who learn their craft alongside AI systems rather than through traditional mentorship? These are empirical questions, and the answers remain unclear.
It's worth noting that OpenAI is not unique in pursuing institutional partnerships. Google, Microsoft, and Anthropic have all made similar moves. But OpenAI's pace and ambition stand out. The company's own leadership updates acknowledge that it has grown significantly, now delivering products used by hundreds of millions of people while remaining focused on frontier AI research. That combination (massive scale plus frontier capabilities) creates a particular kind of leverage. When OpenAI embeds itself into schools, governments, and newsrooms, it's not just selling a product. It's shaping how these institutions understand and use AI, which in turn shapes how they understand and use information itself.
This brings me to what I think is the core issue, and it's one that the research community has been slow to address. The question is not simply whether AI tools are helpful or harmful in specific applications. The question is about the concentration of influence. When a single company, however well-intentioned, becomes the default AI provider for governments, schools, and media organizations across multiple countries, the effects extend far beyond any individual use case. You create dependencies. You create path lock-in. You create situations where institutional knowledge becomes entangled with proprietary systems in ways that are difficult to unwind.
Actually, the research shows that this kind of lock-in is a well-documented phenomenon in technology adoption. Studies of enterprise software, educational technology, and government IT systems have repeatedly found that early choices about platforms and vendors create lasting constraints on institutional behavior. Once an organization has invested in training, workflows, and integrations around a particular system, the costs of switching become prohibitive. This is not a criticism of OpenAI specifically; it's a structural feature of how institutions adopt technology. But it means that the decisions being made right now, often quickly, often without rigorous evaluation, will have consequences that persist for years or decades.
I should acknowledge what I don't know, which is quite a lot. I don't have access to the internal deliberations at OpenAI, the American Journalism Project, the Greek Ministry of Education, or any of the other organizations involved in these partnerships. It's possible that there are careful evaluation frameworks in place that simply haven't been made public. It's possible that the responsible-use guidance being provided is more rigorous than the public announcements suggest. I would genuinely like to be wrong about the lack of independent assessment. But based on the publicly available information, the evidence for systematic evaluation is thin.
What would I want to see? At minimum, pre-registered studies with clear outcome measures. For educational deployments, that means tracking not just AI literacy but also traditional learning outcomes, student engagement, and teacher workload over time. For journalism applications, that means measuring accuracy, productivity, and the development of professional skills. For government use, that means examining decision quality, efficiency, and the distribution of benefits across different populations. These studies should be conducted by independent researchers with full access to relevant data, and the results should be published regardless of whether they favor the technology.
I would also want to see serious attention to the question of alternatives. Right now, the framing of these partnerships tends to assume that AI adoption is inevitable and that the only question is how to do it responsibly. But this forecloses important questions. What would it mean for schools to prioritize human-centered pedagogies that don't rely on AI? What would it mean for newsrooms to invest in traditional reporting infrastructure rather than AI tools? What would it mean for governments to develop in-house technical capacity rather than depending on external vendors? These are not rhetorical questions. They represent genuine policy alternatives that deserve consideration.
The broader context here is that we are in a period of rapid AI deployment that is outpacing our ability to evaluate consequences. This is not a new observation, but it bears repeating. The research community, the policy community, and the journalistic community have all struggled to keep up with the pace of change. Partly this is a resource problem: independent research takes time and money, and the organizations best positioned to study AI deployment are often the same organizations deploying it. Partly it is an access problem: the data needed to evaluate these systems is often proprietary. And partly it is a framing problem: the discourse around AI tends to oscillate between utopian enthusiasm and dystopian fear, neither of which is conducive to careful empirical analysis.
I find myself, somewhat unusually for me, wanting to end on a note of uncertainty rather than conclusion. The initiatives OpenAI is pursuing may turn out to be beneficial. They may turn out to be harmful. They may turn out to be basically neutral, with the AI tools functioning as modest productivity aids that don't fundamentally change institutional dynamics. We don't know yet. And that's precisely the problem. We're making large-scale decisions about the integration of AI into critical social institutions without the evidence base that should inform those decisions. The responsible path forward is not to halt all deployment, which is neither realistic nor necessarily desirable, but to insist on rigorous, independent evaluation as a condition of continued expansion. So far, I'm not seeing that insistence from the institutions that should be demanding it.