Two Papers Just Made Robot Control Smoother. Here's Why That Actually Matters.

New reinforcement learning techniques tackle the jitter problem that's been plaguing autonomous systems for years, and honestly, it's about time.

2 June 20266 min de lecture

Why do robots still move like they're having a seizure?

If you've ever watched a reinforcement learning demo and thought "that robot moves weird," you're not imagining things. The jerky, twitchy movements you see in lab videos are called action jitter, and it's been a dirty secret of the RL community for longer than most of these young founders have been coding.

I've seen this movie before. Back in the early days of industrial robotics, we had similar problems with motor control, and engineers solved it with filters and dampers and lots of trial and error. The RL crowd has been trying to solve it with math, which, call me old-fashioned, seems like the harder path. But two new papers out this week suggest they might finally be getting somewhere.

The first, from a team publishing on arXiv, introduces something called ZAPS-DA (because of course it needs an acronym). The second tackles the same fundamental problem but for fixed-wing UAVs, using what they call HJB-inspired residual filtering. Both papers are attacking the same enemy: policies that work great in simulation but would destroy physical hardware in about thirty seconds flat.

What's actually causing the jitter?

Here's the thing most people outside the field don't understand, continuous control policies trained with off-policy reinforcement learning have a tendency to output high-frequency noise in their actions. The neural network thinks it's making tiny optimizations, but what it's actually doing is telling a motor to go left-right-left-right-left-right faster than the hardware can physically respond.

More in Autonomy

A startup called REO says it will sell a pickup truck for $21,500. The price is striking. The evidence for it is less so.

Aisha Patel · 24 Jun · 9 min

Researchers are patching the 'trajectory scoring gap' in sidewalk robots with VLMs and human attention modeling. The ideas are clever. The caveats are real.

Mark Kowalski · 20 Jun · 6 min

Two new papers tackle one of robotics' most stubborn problems: getting a robot to figure out its location using LiDAR, without needing to have visited the place before.

Sarah Williams · 19 Jun · 5 min

The defense tech startup is moving from drones to full autonomous fighters, and it raises questions about where the line between AI autonomy and human oversight actually sits.

Two Papers Just Made Robot Control Smoother. Here's Why That Actually Matters.

Why do robots still move like they're having a seizure?

What's actually causing the jitter?

More in Autonomy

Does this work outside of driving simulators?

So what does this mean for the industry?

What's still missing?

Sources