The sim-to-real gap might actually be closing, and I've been wrong before

Three papers in two weeks suggest synthetic training data could replace expensive real-world robot demonstrations. I've seen this movie before, but the ending might be different this time.

By Mark Kowalski

1 hour ago6 min de lectura

Crédito de imagen: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source

Google DeepMind just released a manipulation model trained on zero real demonstrations that actually works in the real world.

Let that sink in for a second. Zero. Not "minimal" or "reduced" or "streamlined." Zero real demonstrations. The model learned entirely in simulation and transferred to physical robots without the usual catastrophic failure we've come to expect from sim-to-real approaches. MIT Tech Review and Ars Technica both covered the release, and the independent expert reactions ranged from cautiously optimistic to genuinely surprised.

Now, I've been covering tech since the 90s, and if there's one thing I've learned it's that breakthrough announcements from big labs rarely survive contact with messy reality. I remember when we were all supposed to have self-driving cars by 2020. I remember when speech recognition was going to eliminate keyboards. I remember a lot of things that were supposed to change everything and then didn't, at least not on the timeline anyone promised. So call me old-fashioned, but my default setting is skepticism.

But here's the thing that's got my attention: DeepMind's paper isn't an isolated result. In the past two weeks, I've tracked down two other papers making similar claims through completely different approaches, and that's the kind of convergence that makes even an old skeptic sit up.

The first is from an academic group, published on arXiv, proposing a method for generating synthetic robot demonstrations from just a small seed of real ones. They use a learned simulator (not a hand-engineered one, which matters) and report performance comparable to training sets with 10x more real demonstrations. The benchmarks are standard manipulation tasks, nothing exotic, which actually makes the result more believable in my book. When researchers test on weird custom setups, I always wonder what they're hiding. Standard benchmarks mean other labs can replicate.

Cobertura relacionada

More in AI Models

Five years after AlphaFold solved protein folding, researchers are engineering heat-tolerant plants by redesigning photosynthesis itself.

Sarah Williams · 1 hour ago · 5 min

Google and OpenAI just released benchmarks showing their best models get basic facts wrong 30-40% of the time. That's... not great.

Sarah Williams · 1 hour ago · 5 min

Everyone's focused on AI chatbots manipulating users. The real concern is what happens when these systems control physical hardware.

James Chen · 1 hour ago · 6 min

DeepMind has released so many Gemini variants in the past few months that I genuinely lost count. Here's what's actually going on.

Fuentes