Two New SLAM Papers Want Robots to See the World More Like We Do

A pair of arXiv papers tackle one of robotics' oldest headaches: getting robots to build accurate maps of the world, even when the lighting is terrible or the geometry is tricky.

7 hours ago8 min read

Think about the last time you walked into a dark parking garage. Your eyes adjusted, maybe slowly, but you didn't lose track of where you were. You didn't suddenly forget the shape of the pillars or the slope of the ramp. You just... kept going.

Robots can't do that. Not reliably, anyway. And that gap, between what humans take for granted and what robots can actually manage, is basically the whole problem that SLAM research is trying to close.

SLAM stands for Simultaneous Localization and Mapping. It's the process by which a robot figures out where it is while also building a map of its surroundings. It's been a core challenge in robotics for decades, and honestly, I think a lot of people outside the field assume it's a solved problem by now. It's not. Two new papers posted to arXiv this week suggest researchers are still finding meaningful ways to push it forward, and both of them are leaning on a technique called 3D Gaussian Splatting to do it.

What's Gaussian Splatting, and Why Does It Keep Showing Up?

If you've been following robotics or computer vision for the past couple of years, you've probably seen the term "3D Gaussian Splatting" (3DGS) a lot. I should know this better than I do, but here's my working understanding: instead of representing a scene as a mesh or a point cloud, 3DGS represents it as a collection of overlapping 3D blobs, each with its own position, size, orientation, and color. The result is a representation that's fast to render and surprisingly good at capturing fine visual detail.

The technique took off in computer graphics, where people used it to generate photorealistic novel views of scenes from a handful of photos. Then robotics researchers started asking: what if we used this for mapping? What if a robot could build one of these Gaussian representations in real time, as it moves through the world?

Related coverage

More in Humanoids

A new technique from arXiv mirrors robot demonstrations to double usable training data without collecting a single extra example, and it's simpler than it sounds.

Sarah Williams · 5 hours ago · 6 min

A pair of freshly released robotics datasets tackle opposite ends of the same problem: teaching humanoids what to do, and teaching them what not to do.

Sarah Williams · 2 days ago · 5 min

Three new robotics papers suggest we're past the proof-of-concept phase for humanoid loco-manipulation, and the numbers are starting to back that up.

Mark Kowalski · 2 days ago · 7 min

A cluster of new research is tackling one of robotics' most stubborn problems: getting robots to actually use touch. The sim-to-real gap is the villain of the story.

Two New SLAM Papers Want Robots to See the World More Like We Do

What's Gaussian Splatting, and Why Does It Keep Showing Up?

More in Humanoids

Paper One: Teaching SLAM to Read the Room

Paper Two: What Happens When the Lights Go Out

Why Both Papers Matter for Humanoids (and Robots in General)

The Caveats (There Are Always Caveats)

Where This Is Heading

Sources