Two New Papers Tackle the Same Problem from Opposite Ends: Getting Robots to Move Like They Mean It

A graph diffusion approach to inverse kinematics and an unsupervised motion retargeting framework both dropped this week, and they're more connected than the coverage suggests.

3 June 20268 min de lecture

Most of the discussion I've seen around these two papers treats them as unrelated work. One is about inverse kinematics, the other about motion retargeting. Different problems, different solutions, move along. But actually, the research shows something more interesting: both teams are grappling with the same fundamental challenge of getting robots to produce physically plausible motion without drowning in the combinatorial complexity of articulated systems. They've just approached it from opposite directions.

Let me back up and explain why this matters.

The inverse kinematics problem is deceptively simple to state. You want a robot's hand to be at position X with orientation Y. What joint angles get you there? For a simple arm, this is undergraduate-level math. For a humanoid robot with 30+ degrees of freedom, multiple end-effectors, and the requirement that solutions be physically stable, it becomes genuinely hard. The issue isn't finding a solution; it's that there are infinitely many solutions for redundant systems, most of which will cause the robot to fall over, collide with itself, or move in ways that look obviously wrong to human observers.

Traditional approaches use optimization, basically gradient descent toward a target pose while respecting joint limits. This works, sort of, but it's slow, gets stuck in local minima, and produces solutions that are technically correct but often weird. Learning-based methods have improved things, but most treat the robot as a fixed architecture. Train on one robot, deploy on that robot. Want to use a different platform? Retrain from scratch.

arXiv published GraphDiff-IK this week, and it's worth noting that the core insight here is genuinely new rather than incremental. The authors represent the robot as a kinematic graph constructed directly from the URDF file (the standard robot description format), where nodes are actuated joints and edges encode kinematic dependencies. Then they formulate inverse kinematics as a conditional graph diffusion process that generates joint configurations directly on this graph structure.

More in Humanoids

The headlines are celebrating a $2.5B humanoid robotics deal. I'd pump the brakes a little.

Mark Kowalski · 25 Jun · 6 min

Sometimes the sources don't pan out. Here's what happened when I tried to write a humanoids story this week and ended up with Samsung deals instead.

Sarah Williams · 25 Jun · 3 min

Diffusion models are getting good at imagining robot movements, but 'imaginable' and 'physically possible' aren't the same thing. Researchers are starting to close that gap.

Sarah Williams · 25 Jun · 6 min

A batch of fresh robotics research tackles the same underlying problem from different angles: robots that can see but don't really understand where things are.

Sources