The Data Problem That Won't Die: Why Robot Learning Still Can't Scale

Six new papers promise to solve robot training bottlenecks. I've seen this movie before, but this time the approaches are actually interesting.

By Mark Kowalski

5 hours ago5 min read

Image credit: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source

If you've been covering robotics long enough, you start to recognize patterns. Every few years, someone announces they've cracked the data problem for robot learning. The demo videos look great. The papers get accepted to top conferences. And then, quietly, the approach doesn't quite work outside the lab.

I'm looking at six new papers this week that all tackle the same fundamental issue: robots need enormous amounts of training data, but collecting that data is expensive, dangerous, and slow. The solutions being proposed are clever, I'll give them that. But whether they'll actually move the needle is another question entirely.

The Human Video Gambit

The most ambitious approach comes from a team behind Phantom, which proposes training manipulation policies directly from human video demonstrations, no robot data required. They convert human demos into robot-compatible observation-action pairs using hand pose estimation, then basically Photoshop the human arm out and paste in a rendered robot arm.

The results are surprisingly good, up to 92% success rates on tasks including deformable object manipulation and insertion. Zero-shot deployment on real hardware without fine-tuning. If this holds up, it's a big deal.

But here's what I keep thinking about: we've been here before with self-driving cars. Remember when everyone thought you could just train on dashcam footage from YouTube? The edge cases killed that approach. Manipulation has even more edge cases than driving, arguably, because contact physics are brutally unforgiving.

The paper shows strong results on their test tasks, but remains unclear whether this generalizes to the messy, unpredictable objects you'd find in an actual warehouse or kitchen. I only found the one paper on this specific inpainting approach, so we're working with limited data on whether it scales.

Related coverage

More in AI Models

The company just raised its outlook by a staggering amount, and honestly, I'm trying to figure out if this is real momentum or a peak we're about to fall off.

Sarah Williams · 6 hours ago · 5 min

A $65 billion raise that eclipses OpenAI. I've seen big valuations before, but this one's got me scratching my head.

Robert "Bob" Macintosh · 6 hours ago · 3 min

The private equity giants are seeking additional investors for what would be one of the largest AI infrastructure financing deals to date.

James Chen · 6 hours ago · 4 min

A wave of research papers suggests we might finally crack the robot data problem by ditching robots entirely during training. I've seen this kind of hype before, but this time the numbers are interesting.

The Data Problem That Won't Die: Why Robot Learning Still Can't Scale

The Human Video Gambit

More in AI Models

The Single-Arm Workaround

The Vision-Free Pivot

The VLA Arms Race

So What

Sources