Two New Papers Tackle the Same Self-Driving Problem. One Uses Brute Force, the Other Uses Brains.

Researchers are finally admitting that training autonomous vehicles on human driving data creates mushy, indecisive systems. The fixes are clever, but I've seen this movie before.

5 June 20266 min de lecture

Here's the thing about autonomous vehicle research that nobody wants to admit: we've been training these systems wrong for years. Two papers dropped this week on arXiv that basically say the same thing, which is that when you train an end-to-end driving model on lots of human demonstrations, you get a system that drives like the average of all those humans. Not the best human. Not even a competent human. The average. And the average, it turns out, is pretty mushy.

The researchers call this the "style-averaging" dilemma, which is a polite way of saying the car can't commit to a decision. Should it merge aggressively or wait? The training data contains both behaviors, so the model splits the difference and does something weird that neither an aggressive nor a cautious driver would do. Sometimes that weird thing is kinematically unsafe, which is a very academic way of saying the car tries to do something physically impossible.

So what are they actually proposing?

The first paper, D³-MoE from arXiv, takes what I'd call the brute force approach. Instead of generating one trajectory and hoping it's right, the system generates multiple trajectories in parallel, each representing a different driving "style." Then a downstream module picks the best one based on whatever criteria you care about. Want aggressive? Pick that trajectory. Want grandma-safe? Pick that one instead.

The clever bit is how they handle the physics. They've decoupled longitudinal motion (speeding up, slowing down) from lateral motion (steering left, steering right) and trained separate expert networks for each. These experts don't need manual labels because they learn from the ground truth kinematics themselves, which is elegant if you think about it. The whole thing uses Diffusion Transformers, which are the hot architecture of the moment, and achieves what they claim is state-of-the-art performance on the NAVSIM benchmark: 88.2 PDMS by default, or 91.3 if you let it generate three options and pick the best.

More in Autonomy

A startup called REO says it will sell a pickup truck for $21,500. The price is striking. The evidence for it is less so.

Aisha Patel · 24 Jun · 9 min

Researchers are patching the 'trajectory scoring gap' in sidewalk robots with VLMs and human attention modeling. The ideas are clever. The caveats are real.

Mark Kowalski · 20 Jun · 6 min

Two new papers tackle one of robotics' most stubborn problems: getting a robot to figure out its location using LiDAR, without needing to have visited the place before.

Sarah Williams · 19 Jun · 5 min

The defense tech startup is moving from drones to full autonomous fighters, and it raises questions about where the line between AI autonomy and human oversight actually sits.

Two New Papers Tackle the Same Self-Driving Problem. One Uses Brute Force, the Other Uses Brains.

So what are they actually proposing?

More in Autonomy

Why does this feel familiar?

What about the benchmarks themselves?

The bigger picture

So where does this leave us?

Sources