画像クレジット: Photo by Andrea De Santis on Unsplash · source
Open weights, for robotics, have arrived. Quietly, on a Tuesday, Physical Intelligence dropped π0.5: a foundation model trained directly on robot demonstration data, with the weights released under a permissive license and a substantial slice of the training data alongside them.
Ars Technica was the first to publish details. The model is multi-modal, accepts language, vision and proprioception, and outputs action trajectories for a variety of arm and hand configurations.
We think open is how this field moves forward. — Karol Hausman, Physical Intelligence (via Ars Technica)
Why this is the Llama moment
If you have been watching language models since 2023, you know the rhythm. The first really capable open-weights model arrives, and within a week, hundreds of fine-tunes appear, dozens of evaluation benchmarks emerge, and the conversation about what is possible at every scale of compute shifts permanently.
Robotics has not had that moment. There have been useful research checkpoints, but never anything with both the breadth and the permissive license needed to spawn an ecosystem.
VentureBeat reports three academic groups already planning fine-tunes within the week. That number will be much higher by the time this article is a month old.
What the architecture says about where the field is going
関連記事
More in AI Models
The latest VLA models are starting to fail in a specific, predictable way: they remember the last few seconds, and not much more. Researchers are racing to fix it.
Priya Nair · 23 May · 3 min
A long-running theoretical disagreement inside robotics research is starting to resolve in favour of one side. The implications are bigger than they sound.
Lena Park · 23 May · 3 min
Nvidia's humanoid robotics foundation model has been talked about for two years. The new SDK release is the first time it looks like a serious platform play.
Nadia Rahman · 23 May · 3 min
Open X-Embodiment was supposed to be a research curiosity. A year on, it is the default dataset for serious robot manipulation research.