The boring sensor fusion papers that will actually make your robot work
While everyone's chasing humanoid hype, two new papers tackle the unsexy problem of knowing where your robot actually is.
Crédito da imagem: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source
Most of the robotics coverage I've read this week has been about humanoids doing backflips or foundation models that can supposedly fold laundry. Fine, that stuff gets clicks. But if you've ever actually deployed a robot outside a controlled demo environment, you know the real problem isn't teaching it to fold, it's teaching it to know where it is when the GPS cuts out for three seconds because it drove under a bridge.
Two papers dropped on arXiv this week that nobody's going to write breathless headlines about, and that's exactly why I'm writing about them. They're solving the actual hard problem.
The problem nobody wants to talk about
Here's what happens in the real world: your autonomous vehicle or ground robot has an IMU (the thing that measures acceleration and rotation), GPS, maybe some wheel encoders, maybe a camera running visual SLAM. Each of these sensors lies to you in different ways. The IMU drifts. GPS jumps around or disappears entirely. Wheel encoders slip. Your visual system loses tracking when the sun hits the lens wrong.
The standard approach is to fuse all this data through a Kalman filter, basically a mathematical way of combining noisy measurements to get a better estimate than any single sensor could give you. This has been the bread and butter of autonomous navigation since, well, since before most of the founders raising money for robotics companies were born.
The problem is that standard implementations kind of suck when conditions get hard. I've seen this movie before, by the way, back when everyone was promising self-driving cars by 2020 and then discovered that sensor fusion in the real world is messier than sensor fusion in simulation.
Cobertura relacionada
More in Autonomy
The Luce is weird, expensive, and nobody asked for it. Ferrari doesn't care. I've seen this movie before.
Mark Kowalski · 42 mins ago · 5 min
Two new papers tackle robot navigation with pixel-level maps and dynamic scene graphs. I've seen this kind of progress before, and I'm cautiously optimistic.
Mark Kowalski · 42 mins ago · 5 min
Two new papers show how visual AI can build maps that actually work for navigation, and I'm cautiously optimistic.
Robert "Bob" Macintosh · 43 mins ago · 4 min
New research shows convex-guided neural sampling can cut robot path planning time by up to 98%, though the real-world implications remain murky.
