Space robots and kitchen robots are learning the same lesson: humans still need to hold the wheel

Two new papers show reinforcement learning works better when we stop pretending AI can figure everything out alone.

2 hours ago6 min de lectura

Crédito de imagen: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source

Why do we keep learning the same lesson over and over?

I've been covering tech long enough to remember when neural networks were going to replace programmers entirely (they didn't), when expert systems would automate doctors (nope), and when self-driving cars were five years away (that was fifteen years ago). The pattern is always the same: researchers announce that AI can do X autonomously, then quietly discover that actually, humans need to stay involved, and the real breakthrough is figuring out exactly how much involvement and when.

Two papers crossed my desk this week that, on the surface, have nothing to do with each other. One's about space manipulators, the kind of robotic arms that service satellites. The other's about teaching a Franka robot arm to do contact-rich manipulation tasks in a lab. Different domains, different teams, different continents probably. But they're both grappling with the same fundamental problem, and arriving at remarkably similar conclusions.

The space problem

Researchers from (I'm guessing) a Chinese university have been working on what they call "Dual-Agent Coordinated Manipulation Planning," or DACMP. The setup: you've got a spacecraft with a 6-degree-of-freedom robotic arm attached to it. You want the arm to reach out and grab something, maybe a satellite that needs servicing, maybe debris that needs clearing. Simple enough on Earth. In space? The moment that arm moves, Newton's third law kicks in and your entire spacecraft starts rotating the other direction.

This is the dynamic coupling problem, and it's been a headache for space robotics since the Canadarm days. The paper's contribution isn't the problem statement, it's the solution architecture. They use deep reinforcement learning, which is standard these days, but with a twist they call "Timestep-level Expert Switching Guidance" or TESG.

Cobertura relacionada

More in Research

Three papers crossed my desk this week that suggest we're finally getting serious about making robots do what we actually tell them to do.

Robert "Bob" Macintosh · 25 mins ago · 4 min

Researchers are finding ways to train robots with far less data, using human corrections and physics simulators instead of millions of demonstrations.

James Chen · 25 mins ago · 6 min

A batch of new research papers suggests we might finally be solving the sample efficiency problem that's plagued robotics for years, and I've seen this inflection point before.

Mark Kowalski · 25 mins ago · 5 min

Two new papers show hexapods and transformable drones doing whole-body manipulation, which is the kind of unsexy problem that actually matters.

Space robots and kitchen robots are learning the same lesson: humans still need to hold the wheel

The space problem

More in Research

The kitchen problem

So what

What happens next

Fuentes