Two New Papers Show Why Humanoid Robots Still Can't Reliably Pick Up a Heavy Box

Researchers are making real progress on the 'sim-to-real' gap, but the solutions reveal just how far we are from robots that work outside the lab.

By Sarah Williams

6 hours ago5 min de leitura

Crédito da imagem: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source

Here's something that should be simple: a humanoid robot picks up a six-kilogram box from the floor and places it on a shelf.

It's the kind of task a warehouse worker does hundreds of times a day without thinking. But for robots, this remains genuinely hard. Two papers published this week on arXiv tackle different pieces of this problem, and honestly, reading them back to back tells you a lot about where humanoid manipulation actually stands.

The weight problem nobody talks about

The first paper, from researchers presenting SplitAdapter, addresses something I initially thought was a solved problem: how do you train a robot in simulation and have it work in the real world when the object it's carrying changes weight?

Turns out, it's messier than I expected.

When a humanoid picks up a 2kg box versus a 6kg box, everything changes. The robot's center of mass shifts. Its joints experience different torques. The timing of its steps needs adjustment. And here's the tricky part: these dynamics interact with each other in ways that are hard to disentangle.

Previous approaches tried to handle this by compressing all the relevant information (object weight, robot dynamics, contact forces) into a single latent representation. The robot would, in theory, figure out what mattered. But the SplitAdapter team found this breaks down under heavy loads. The representation gets muddy, and the robot's performance degrades exactly when you need it most.

Their solution is to factor the problem explicitly. One encoder handles object and load awareness. Another handles robot dynamics. They train these separately with different objectives, then combine them using something called Feature-wise Linear Modulation (basically, a way to let each factor influence the robot's behavior without interfering with the other).

Cobertura relacionada

More in Humanoids

Researchers are finally tackling the boring-but-brutal problem of making robots handle heavy stuff without falling over.

Sarah Williams · 6 hours ago · 5 min

A graph diffusion approach to inverse kinematics and an unsupervised motion retargeting framework both dropped this week, and they're more connected than the coverage suggests.

Aisha Patel · 2 days ago · 8 min

Three separate papers this month show how easy it is to hijack vision-language-action models with adversarial patches and poisoned training data. The robots don't even know they're compromised.

Sarah Williams · 3 days ago · 5 min

Two new research papers suggest the future of robotics isn't full autonomy — it's figuring out when humans should take over, and when they shouldn't.

Two New Papers Show Why Humanoid Robots Still Can't Reliably Pick Up a Heavy Box

The weight problem nobody talks about

More in Humanoids

Scaling through data, not just parameters

What these papers don't solve

Why this matters for the humanoid hype cycle

Fontes