The Control Theory Renaissance Nobody's Talking About
Three papers crossed my desk this week that suggest we're finally getting serious about making robots do what we actually tell them to do.
Image credit: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source
Look, here's the thing. I've been watching robotics research for longer than some of my readers have been alive, and every few years you see a cluster of papers that tells you the field is quietly pivoting. This week, three preprints hit arXiv that all circle the same problem: how do you get a robot to follow constraints without cheating?
When I was at Kuka, we spent an embarrassing amount of time on what we called "the fudge factor problem." You'd set up a controller, tell it to keep the end effector on a specific path, and it would sort of do that. Mostly. Unless it didn't feel like it. The standard approach was to add cost penalties, basically bribing the algorithm to stay on track. It worked well enough for pick and place. It fell apart the moment you needed precision.
Hard Constraints Are Hard
The first paper that caught my eye was from the RCI Lab, introducing something called Manifold-Constrained MPPI (arXiv). MPPI, Model Predictive Path Integral control, has been around for a while. It's derivative-free, which means you can throw it at messy nonlinear systems without doing calculus. The catch is it uses soft penalties. Tell it "don't hit the wall" and it treats that as a suggestion.
The new approach splits the problem in two. First, a variational autoencoder learns a compressed version of the constraint space, so your samples are already close to valid. Then a quadratic programming layer cleans up the residuals. They're running this on a 14-degree-of-freedom dual-arm system at 100 Hz. That's not trivial. I called my old colleague at Siemens who works on similar setups, and he was genuinely surprised they got it stable.
Now, I'll be honest, I haven't seen the real-world demos yet. The supplementary videos might show a carefully controlled environment. But the architecture is sound.
Teaching Humanoids Without the Geometric Crutch
The second paper tackles imitation learning for humanoids (arXiv). The standard pipeline goes: capture human motion, map it to robot kinematics, then figure out the dynamics. The authors argue this intermediate step introduces what they call "geometric bias." You're basically forcing the robot to move like a human-shaped puppet before it gets to think about physics.
Their Direct Dynamic Retargeting skips the kinematic projection entirely. You go straight from video to dynamically feasible trajectories using sampling-based MPC inside a physics simulator. The claim is that this produces better tracking and faster reinforcement learning convergence.
Sources
- Manifold-Constrained MPPI: Real-Time Sampling-Based Control Under Hard Constraints· arXiv — cs.RO (Robotics)
- Direct Dynamic Retargeting for Humanoid Imitation Learning from Videos· arXiv — cs.RO (Robotics)
- Implicit Null-space Manifold Generation for Redundant Robotic Systems· arXiv — cs.RO (Robotics)
Related coverage
More in Research
Forget the hype about AI designing robots. These researchers are solving the boring, fundamental problem that's held back the field for decades.
Mark Kowalski · 12 mins ago · 5 min
Researchers are building systems where robots evolve their own bodies and controllers together, then save what they learn for next time.
James Chen · 13 mins ago · 6 min
Researchers are finding ways to train robots with far less data, using human corrections and physics simulators instead of millions of demonstrations.
James Chen · 2 hours ago · 6 min
A batch of new research papers suggests we might finally be solving the sample efficiency problem that's plagued robotics for years, and I've seen this inflection point before.