New Research Shows Robots Learning to Actually Think About Their Mistakes

Two papers from arXiv tackle the same problem I watched engineers struggle with for years: getting robots to learn from failure instead of just failing repeatedly.

27 May 20263 min de lecture

A pair of new research papers are tackling something that's bothered me since my Kuka days: robots that make the same dumb mistake over and over because they can't reflect on what went wrong.

Look, here's the thing. When I was at Kuka, we had a palletizing cell that would occasionally knock a box off the stack. Same box, same position, maybe twice a week. The robot had no idea it had done anything wrong. Next cycle, same motion, same risk. We ended up solving it with better fixturing and some sensor work, but the underlying problem never went away. The robot couldn't learn from its own failures.

These two papers from arXiv are finally attacking that problem head-on. The first one, "Reflective Test-Time Planning," introduces what the researchers call reflection-in-action and reflection-on-action. Basically, the robot thinks before it moves (generating and scoring multiple options) and then actually updates its behaviour after something goes wrong. They even added retrospective reflection, which lets the system go back and reconsider earlier decisions with hindsight. I called my old colleague at Siemens about this, and his reaction was basically "about time."

They tested it on a Franka Panda arm, which is a decent choice for research but, I'll be honest, a far cry from the payload and speed requirements of a real production cell. Still, the approach is sound. The idea of a robot that can assign credit (or blame) to decisions made several steps earlier is something we've needed for years. Anyone who's debugged a 47-step assembly sequence knows what I mean.

More in Industrial

The Apple supplier priced its shares at the maximum and still had to turn away demand, which tells you something about where hardware money is flowing right now.

James Chen · 25 Jun · 5 min

Prime Day deals on Echos and Ring cameras are fine, but let's not confuse consumer gadgets with the serious robotics work happening in warehouses.

Robert "Bob" Macintosh · 25 Jun · 3 min

Amazon's CEO made his first India trip and left behind a $13 billion AI commitment and an aggressive quick-commerce expansion. The numbers are real. The execution is the hard part.

James Chen · 25 Jun · 6 min

A wave of arXiv preprints this week tackles one of manipulation's oldest problems: how do you get a robot to learn from imperfect, incomplete, or just plain missing data?

Sources