Crédit photo: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source
Most coverage of the recent agentic AI announcements has focused on the wrong thing. Headlines trumpet GitHub stars and enterprise partnerships, but the actual news is more structural: we're watching the infrastructure layer of agentic AI get carved up in real time, and the decisions being made now will shape this market for years.
Let me be specific about what happened. NVIDIA announced that Hermes Agent crossed 140,000 GitHub stars in under three months. OpenAI co-founded something called the Agentic AI Foundation under the Linux Foundation and donated a specification called AGENTS.md. OpenAI also updated its Agents SDK with sandbox execution capabilities. And Accenture announced a partnership with OpenAI to deploy agentic AI in enterprises.
Read most tech coverage, and you'd think these are four separate stories. They're not. They're four moves in the same game.
OpenAI's decision to create the Agentic AI Foundation under the Linux Foundation is the most consequential announcement here, and it's gotten the least attention. The foundation exists to support "open, interoperable standards for safe agentic AI," which sounds like corporate boilerplate until you think about what it means in practice.
AGENTS.md is a specification. Specifications become standards. Standards become the rules everyone has to follow. OpenAI donating this to a Linux Foundation project is a classic infrastructure play: give away the standard, shape the standard, then build the best implementation of the standard.
I've seen this pattern before in hardware. From my time building industrial systems, I watched companies donate interface specifications to standards bodies while simultaneously developing proprietary extensions that only worked with their hardware. The spec is open. The best implementation is not.
À lire aussi
More in AI Models
Five years after AlphaFold solved protein folding, researchers are engineering heat-tolerant plants by redesigning photosynthesis itself.
Sarah Williams · 1 hour ago · 5 min
Google and OpenAI just released benchmarks showing their best models get basic facts wrong 30-40% of the time. That's... not great.
Sarah Williams · 1 hour ago · 5 min
Three papers in two weeks suggest synthetic training data could replace expensive real-world robot demonstrations. I've seen this movie before, but the ending might be different this time.
Mark Kowalski · 1 hour ago · 6 min
Everyone's focused on AI chatbots manipulating users. The real concern is what happens when these systems control physical hardware.
That's an ambitious framing for what might just be a markdown file, I'll grant. But the timing matters. We're at the stage where agentic AI frameworks are proliferating wildly, and whoever defines how agents communicate, handle permissions, and interact with tools will have enormous influence over the ecosystem.
NVIDIA's blog post leads with the 140,000 GitHub stars figure for Hermes Agent. That's a big number. It's also a weird metric to emphasize.
GitHub stars measure interest, not usage. They measure how many people clicked a button, not how many people deployed something in production. Look, I get why companies use this metric. It's big and impressive and easy to explain. But when I see a press release leading with GitHub stars rather than, say, active deployments or API calls or actual revenue, I start wondering what numbers they're not sharing.
The more interesting detail buried in NVIDIA's announcement is the positioning. Hermes is described as running on "NVIDIA RTX PCs and DGX Spark." That's a hardware play. NVIDIA wants agentic AI workloads running on NVIDIA silicon, whether that's consumer RTX cards or enterprise DGX systems. The agent framework is a means to that end.
This is where the infrastructure war gets tangible. You have:
NVIDIA pushing frameworks that run best on NVIDIA hardware
OpenAI pushing standards and SDKs that work best with OpenAI models
Various open source projects trying to remain hardware and model agnostic
The 140,000 stars suggest Hermes has community momentum. Whether that translates to production deployments is, well, it's too early to say. The real test is whether enterprises actually build on this or treat it as another framework to evaluate and shelve.
The Agents SDK update adds two things worth noting: native sandbox execution and what OpenAI calls a "model-native harness."
Sandbox execution matters for security. If you're building agents that can execute code, modify files, and use tools autonomously, you need isolation. You need to ensure that an agent gone wrong can't access your production database or email your customers. Native sandbox support means developers don't have to build this themselves, which lowers the barrier to building agents that actually do things rather than just chat.
The model-native harness is more opaque. OpenAI describes it as helping developers build "secure, long-running agents across files and tools." That's vague enough that I'm not entirely sure what it means in practice. It sounds like infrastructure for agents that persist over time rather than handling single requests, but the announcement doesn't include enough technical detail to evaluate whether this is genuinely new or just repackaging existing capabilities.
What I can say is that these updates position OpenAI's SDK as the path of least resistance for developers building agentic systems. You can use other frameworks. You can use other models. But if you want sandbox execution and long-running agent support out of the box, OpenAI is making that easy.
The Accenture partnership is the announcement that matters most for near-term revenue and matters least for long-term technology direction.
Accenture is enormous. They have relationships with basically every large enterprise on the planet. A partnership to "bring agentic AI capabilities into the core of their business" means Accenture consultants will be recommending OpenAI-based solutions to their clients. That's distribution.
But it's also enterprise theater in a familiar way. Large consulting firms announce AI partnerships constantly. The question is always what actually gets deployed versus what gets announced, piloted, and quietly shelved. I don't have data on Accenture's track record here (and if anyone does, I'd genuinely like to see it), but my default assumption with enterprise AI partnerships is that announced deployments outnumber production deployments by a significant margin.
The more interesting signal is that OpenAI is pursuing this channel at all. They're not just building APIs and hoping developers find them. They're actively working the enterprise sales motion through partners. That suggests they see the enterprise market as critical for agentic AI, which makes sense. Consumers might play with chatbots, but enterprises will pay for agents that automate actual work.
Several important questions remain unanswered by these announcements:
Actual deployment numbers. GitHub stars, partnership announcements, and foundation launches are all leading indicators at best. We don't know how many agents built on any of these frameworks are running in production. We don't know what tasks they're performing. We don't know failure rates or human oversight requirements.
Interoperability in practice. The Agentic AI Foundation talks about open, interoperable standards. But will agents built on OpenAI's SDK actually interoperate with agents built on Hermes? Will AGENTS.md become a real standard or a reference implementation that everyone ignores? It's too early to say.
Safety and control mechanisms. Every announcement mentions safety. None of them provide detailed technical specifications for how safety is implemented. What happens when an agent takes an action it shouldn't? What are the rollback mechanisms? What audit trails exist? These questions matter more as agents get more capable, and I haven't seen satisfying answers from anyone.
The economics. Running agents is compute-intensive. Long-running agents that persist over time and execute code in sandboxes will cost more than simple API calls. Who pays for that compute? How do the economics work for enterprise deployments? NVIDIA presumably wants to sell hardware. OpenAI presumably wants to sell API calls. But the actual cost structure of production agentic systems remains unclear.
Here's what I think most coverage misses: the interesting competition isn't between individual agent frameworks. It's between visions of how the agentic AI stack should work.
NVIDIA wants a world where agents run on local hardware (their hardware) with cloud optional. Their emphasis on RTX PCs and DGX Spark points toward edge deployment, where the agent runs near the user rather than in a distant data center.
OpenAI wants a world where agents are cloud-native, built on their APIs, following their standards, with their models at the center. The SDK updates, the foundation launch, the Accenture partnership all point toward a centralized, API-driven architecture.
Neither vision is obviously right. Edge deployment offers latency and privacy advantages. Cloud deployment offers easier updates and potentially lower hardware requirements for end users. The answer probably varies by use case.
But the infrastructure decisions being made now will constrain what's possible later. If AGENTS.md becomes the standard for agent communication, OpenAI has shaped the playing field. If Hermes becomes the default framework for NVIDIA hardware, that's a moat. If Accenture standardizes on OpenAI for enterprise deployments, that's distribution that's hard to replicate.
The 140,000 GitHub stars are noise. The foundation launch is signal. The SDK updates are signal. The enterprise partnership is signal. The signal is that the infrastructure layer of agentic AI is being actively contested, and the winners will have enormous influence over what agents can do and who profits from them.
I don't know who wins. I'm not sure anyone does yet. But I know that in a way, the framework and standards battles matter more than the individual agent capabilities everyone's excited about. Capabilities improve every few months. Infrastructure decisions stick around for years.