The Great Humanoid Transfer Problem: Five Papers Point Toward a Post-Training-From-Scratch Future

A cluster of new research suggests we might finally be able to stop retraining humanoid control policies from scratch every time someone builds a new robot. The catch? We're not quite there yet.

By Aisha Patel

2 hours ago9 min read

Image credit: Lottie animation by Centre Robotics (LottieFiles Free, used with credit). · source

If you have ever trained a neural network to control a robot, you know the particular frustration of watching your carefully tuned policy fail completely the moment you swap in a different actuator or change the link lengths by a few centimeters. It is a bit like teaching someone to ride a bicycle, then handing them a unicycle and expecting the same performance. The body is different, the dynamics are different, and all that painstaking training seems to evaporate. This week, a cluster of five papers on arXiv suggests the robotics community is making serious progress on this problem, though (and I want to be precise here) the solutions remain partial and the claims require careful parsing.

The core challenge is what researchers call cross-embodiment transfer. You train a policy on Robot A, and you want it to work on Robot B without starting over. This matters enormously for humanoids specifically because, unlike industrial arms that follow standardized form factors, humanoid platforms are proliferating with wildly different morphologies. Unitree's G1, LimX's Oli and Luna, Figure's robots, Tesla's Optimus: each has different joint configurations, different mass distributions, different actuator characteristics. Training from scratch on each platform is expensive, time-consuming, and fundamentally wasteful if the underlying skills (walking, balancing, manipulating objects) are conceptually similar.

The paper that caught my attention first is Any2Any, which proposes what the authors call a paradigm for transferring whole-body tracking specialists across embodiments. The approach has two stages. First, kinematic alignment maps the input and output spaces between source and target robots so the pretrained policy's outputs are at least geometrically meaningful on the new platform. Second, dynamics adaptation applies parameter-efficient fine-tuning to modules that are particularly sensitive to the physical differences between robots. The headline result is genuinely striking: using only 1% of the compute and data required for full training, the authors claim to successfully transfer Sonic models pretrained on Unitree G1 to both LimX Oli and LimX Luna. That is a substantial efficiency gain if it holds up.

Related coverage

More in Humanoids

Six new papers promise to fix vision-language-action models. I'm cautiously optimistic, but the gap between simulation and reality remains massive.

Sarah Williams · 2 hours ago · 4 min

A trio of arXiv papers this week suggests the field is converging on diffusion-based approaches trained on massive motion datasets, but the real bottleneck might not be algorithms.

James Chen · 4 hours ago · 5 min

Three new papers dropped this week that suggest we've been watching the wrong competition.

Sarah Williams · 4 hours ago · 4 min

Three new papers tackle the same underlying issue: we've been forcing robots into kinematic boxes that don't fit their actual capabilities.

Sources