Reinforcement Learning Is Finally Making Drones Fly Better, and I've Seen This Before

Two new papers show RL-based controllers outperforming traditional PID systems, but let's not pretend this is the first time someone promised smarter flight control.

2 June 20265 min de leitura

Two papers dropped on arXiv this month showing reinforcement learning controllers beating traditional PID systems for drone flight control, and if you've been around long enough, you're probably already bracing for the hype cycle.

I've seen this movie before. Back in the early 2010s, everyone was convinced machine learning would replace classical control theory within five years. Then reality happened. But here's the thing, these new results are actually pretty interesting, and I'm not just saying that because I need something to write about.

The first paper, from researchers working with a Twin Rotor Aerodynamic System (TRAS), used something called Twin Delayed Deep Deterministic Policy Gradient, or TD3 if you don't want to sound like you're ordering off a menu. The TRAS is basically a helicopter training rig that's notoriously difficult to control because of its nonlinear dynamics and coupling effects between the rotors. Traditional control algorithms struggle with it. The researchers trained an RL agent to stabilize the system at specific pitch and azimuth angles and track trajectories, and here's what caught my attention: they actually tested it against wind disturbances and ran experiments on real hardware, not just simulations.

That last part matters more than you'd think. A lot of RL control papers stay safely in simulation land where physics is negotiable and motors never fail. These folks actually built the thing and watched it work. Or at least, work better than PID under the same conditions.

The second paper from arXiv takes a different approach to quadrotor control that, call me old-fashioned, I find more elegant. Instead of having the RL agent directly control motor RPMs (which is what most papers do), they have it output a thrust vector, basically telling the drone "go this direction with this much force" and letting a conventional PID controller handle the low-level motor commands.

Cobertura relacionada

More in Drones

A pair of arXiv preprints tackle the same hard problem in drone control from very different angles. The results are promising, but the gap between 'outdoor experiment' and 'deployed system' remains large.

Aisha Patel · 25 Jun · 9 min

The Air 3S discount is real, but the urgency behind it has less to do with Amazon's sale and more to do with where DJI hardware might be headed.

Mark Kowalski · 23 Jun · 6 min

Procurement is the easy headline. Actually building enough drones, fast enough, with American parts, is the problem nobody's solved yet.

Mark Kowalski · 20 Jun · 7 min

The FCC just removed a narrow category of toy drones from its Covered List. It's a small move, but it tells you something about where the regulatory wind is blowing.

Fontes