The Robot Navigation Problem Nobody Wants to Talk About: Uncertainty

Two new papers tackle the same uncomfortable truth, that robots don't actually know what they're looking at half the time.

By Mark Kowalski

1 hour ago5 min de lectura

Crédito de imagen: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source

Here's a confession that'll make the young founders squirm: I've been covering autonomous navigation since before most of today's robotics PhDs were born, and the fundamental problem hasn't changed. Robots are still basically guessing. They're just getting better at hiding it.

Two papers dropped on arXiv this week that, for once, actually grapple with this instead of pretending it away. PSG-Nav from one research team and HSAN from another are both trying to solve what I'd call the "is that a chair or a dog" problem, except it's way more consequential when your delivery robot decides a toddler is furniture.

The core issue is semantic uncertainty, and I've seen this movie before. Back in the early days of self-driving cars, everyone was obsessed with making perception systems more confident. More training data! Better sensors! The reality is that the real world is messy and weird and full of objects that don't fit neatly into categories. A beanbag chair. A person in a mascot costume. A pile of laundry that, okay, actually does look like a small animal from certain angles.

Most navigation systems today handle this by just, well, picking the most likely option and running with it. The PSG-Nav researchers call this "local optimal deterministic approaches" which is academic-speak for "making your best guess and hoping for the best." It works until it doesn't, and when it doesn't you get robots bumping into things or, worse, freezing up entirely because they can't reconcile what they're seeing with what they expected.

What PSG-Nav proposes is actually clever, maybe too clever, but clever. Instead of committing to a single interpretation of what the robot sees, it builds what they call a "3D Probabilistic Scene Graph" that maintains full probability distributions. So instead of saying "that's definitely a chair (95% confidence)," it says "that's probably a chair but might be a stool or a small table or seventeen other things, here's the breakdown." Then, and this is the interesting part, it samples multiple possible "world settings" from these distributions and evaluates navigation decisions against all of them.

Cobertura relacionada

More in Autonomy

A wave of new research is pushing multi-modal perception forward, and honestly, the progress is more incremental than revolutionary.

Sarah Williams · 1 hour ago · 4 min

New reinforcement learning techniques tackle the jitter problem that's been plaguing autonomous systems for years, and honestly, it's about time.

Mark Kowalski · 1 hour ago · 6 min

Two new papers tackle the same old problem I've been griping about since my Kuka days: you can have accurate robot control or fast robot control, but getting both is still a pain.

Robert "Bob" Macintosh · 3 hours ago · 3 min

A flurry of new research papers claim big improvements in robot navigation. Some of it's genuinely clever, some of it's solving problems we created for ourselves.

Fuentes