Planetary Rovers Are Learning to Walk on Sand, and the Results Are Surprisingly Practical

New research from NASA JPL and university labs shows reinforcement learning can teach rovers to handle loose soil without getting stuck, cutting energy use by 37% on sandy slopes.

10 June 20266 min read

The rover sits at the base of a 20-degree sandy slope, its wheels half-buried in the kind of loose, dry soil that has stranded countless machines before it. A traditional suspension system would spin uselessly here, digging deeper with every rotation. But this one, a four-wheeled prototype called ERNEST, shifts its weight forward, reconfigures its wheel angles, and climbs. No human is driving. A neural network trained entirely in simulation is making every decision.

I've seen enough spec sheets to know when a lab demo is just that, a demo. But the research coming out of planetary robotics labs this month suggests something more substantial: reinforcement learning is finally solving the granular terrain problem that has plagued off-world rovers for decades.

What makes loose soil so difficult?

Anyone who's driven on a beach knows the feeling. Wheels sink. Traction disappears. The harder you push, the worse it gets.

For planetary rovers, this isn't an inconvenience. It's mission-critical. The lunar surface is covered in regolith, a fine, powdery material that behaves nothing like the rigid ground most robots are designed for. Mars has similar challenges. And the physics involved, something called Bekker-Wong terramechanics, is notoriously difficult to model accurately.

Traditional rover designs address this with wide wheels, low speeds, and conservative path planning. The Curiosity rover on Mars, for instance, moves at a maximum speed of about 0.14 km/h. That's not a typo. It's slower than a garden snail.

The new research takes a different approach: instead of avoiding difficult terrain, teach the rover to adapt to it.

Related coverage

More in Autonomy

A startup called REO says it will sell a pickup truck for $21,500. The price is striking. The evidence for it is less so.

Aisha Patel · 24 Jun · 9 min

Researchers are patching the 'trajectory scoring gap' in sidewalk robots with VLMs and human attention modeling. The ideas are clever. The caveats are real.

Mark Kowalski · 20 Jun · 6 min

Two new papers tackle one of robotics' most stubborn problems: getting a robot to figure out its location using LiDAR, without needing to have visited the place before.

Sarah Williams · 19 Jun · 5 min

The defense tech startup is moving from drones to full autonomous fighters, and it raises questions about where the line between AI autonomy and human oversight actually sits.

Metric	ERNEST (wheeled)	Lunar quadruped	Neuromorphic RL
Platform	4-wheeled rover	Simulated quadruped	A1 quadruped (sim)
Terrain tested	Sand, rocks, slopes	Lunar regolith (simulated)	Uneven rigid terrain
Sim-to-real transfer	Yes	No	No
Energy reduction	37% on sandy slopes	Not demonstrated	Comparable to baseline
Training efficiency	Not reported	Not reported	4.3x memory improvement

Planetary Rovers Are Learning to Walk on Sand, and the Results Are Surprisingly Practical

What makes loose soil so difficult?

More in Autonomy

ERNEST and the active suspension breakthrough

The lunar quadruped problem

Neuromorphic learning: a wildcard approach

What the numbers actually say

The bigger picture

What comes next

Sources