The Robot Navigation Problem Nobody Wants to Talk About: Uncertainty
Two new papers tackle the same uncomfortable truth, that robots don't actually know what they're looking at half the time.
Crédit photo: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source
Here's a confession that'll make the young founders squirm: I've been covering autonomous navigation since before most of today's robotics PhDs were born, and the fundamental problem hasn't changed. Robots are still basically guessing. They're just getting better at hiding it.
Two papers dropped on arXiv this week that, for once, actually grapple with this instead of pretending it away. PSG-Nav from one research team and HSAN from another are both trying to solve what I'd call the "is that a chair or a dog" problem, except it's way more consequential when your delivery robot decides a toddler is furniture.
The core issue is semantic uncertainty, and I've seen this movie before. Back in the early days of self-driving cars, everyone was obsessed with making perception systems more confident. More training data! Better sensors! The reality is that the real world is messy and weird and full of objects that don't fit neatly into categories. A beanbag chair. A person in a mascot costume. A pile of laundry that, okay, actually does look like a small animal from certain angles.
Most navigation systems today handle this by just, well, picking the most likely option and running with it. The PSG-Nav researchers call this "local optimal deterministic approaches" which is academic-speak for "making your best guess and hoping for the best." It works until it doesn't, and when it doesn't you get robots bumping into things or, worse, freezing up entirely because they can't reconcile what they're seeing with what they expected.
What PSG-Nav proposes is actually clever, maybe too clever, but clever. Instead of committing to a single interpretation of what the robot sees, it builds what they call a "3D Probabilistic Scene Graph" that maintains full probability distributions. So instead of saying "that's definitely a chair (95% confidence)," it says "that's probably a chair but might be a stool or a small table or seventeen other things, here's the breakdown." Then, and this is the interesting part, it samples multiple possible "world settings" from these distributions and evaluates navigation decisions against all of them.
They're calling this "Multiverse Decision Making" which, call me old-fashioned, sounds like marketing got involved. But the underlying idea is sound! If your robot isn't sure whether that blob is a trash can or a person, plan routes that work reasonably well in both scenarios rather than betting everything on your first guess.
The numbers are actually impressive, though I've learned to be skeptical of benchmark results. PSG-Nav claims 66.1% success rate on MP3D, 44.8% on HM3D, and 67.9% on HSSD. For context, these are standard indoor navigation benchmarks, and those numbers represent state-of-the-art performance. The 44.8% on HM3D might look underwhelming, but that benchmark is notoriously difficult, lots of cluttered residential environments where everything looks like everything else.
Sources
- PSG-Nav: Probabilistic Scene Graph Navigation via Multiverse Decision Making· arXiv — cs.RO (Robotics)
- Hierarchical Semantic-Augmented Navigation: Optimal Transport and Graph-Driven Reasoning for Vision-Language Navigation· arXiv — cs.RO (Robotics)
À lire aussi
More in Autonomy
A wave of new research is pushing multi-modal perception forward, and honestly, the progress is more incremental than revolutionary.
Sarah Williams · 1 hour ago · 4 min
New reinforcement learning techniques tackle the jitter problem that's been plaguing autonomous systems for years, and honestly, it's about time.
Mark Kowalski · 1 hour ago · 6 min
Two new papers tackle the same old problem I've been griping about since my Kuka days: you can have accurate robot control or fast robot control, but getting both is still a pain.
Robert "Bob" Macintosh · 3 hours ago · 3 min
A flurry of new research papers claim big improvements in robot navigation. Some of it's genuinely clever, some of it's solving problems we created for ourselves.