Robots That Know When to Leave Things Alone: Two New Papers Worth Your Time
A pair of recent robotics papers tackle something most automation engineers quietly wrestle with: what should a robot do when it's not sure?
By
So here's the question I keep getting from people who've been in this industry a while: when do we stop trying to make robots do more, and start making them smarter about what they shouldn't do?
Two papers out of arXiv's robotics section this month are poking at exactly that problem, and I'll be honest, they got my attention in a way that a lot of academic work doesn't.
The first one is about household rearrangement, but don't let that put you off. The system is called APOLLO, from a paper titled arXiv cs.RO "Abstention-Aware Personalized Object Rearrangement via Uncertainty-Guided LLM Assistance." The core idea is that a robot tidying up your home needs to know not just where to put things, but when to not put them anywhere at all. They call this "abstention," and it's a more useful concept than it sounds.
When I was at Kuka, we spent a lot of time on the placement side of pick-and-place. The assumption was always that if the robot could identify the object and had a target location, you were done. What we didn't think hard enough about was the cases where the target was wrong, obscured, or just ambiguous. The robot would commit anyway. Sometimes that was fine. Sometimes it wasn't. I watched a KR 150 cheerfully stack a brake component on top of another brake component in a way that made the QA team very unhappy. The robot wasn't wrong exactly, it just had no concept of uncertainty.
APOLLO addresses this with a hybrid approach. There's a lightweight personalised embedding model (they call it PEM) that runs entirely on CPU, which is a practical choice I appreciate. It's trained per user-environment pair with a small number of demonstrations, and crucially, it produces uncertainty estimates. When uncertainty is high, it kicks the decision up to an LLM. When it's low, it handles things locally. That's a sensible architecture, and it keeps LLM usage down substantially compared to prior baselines, which matters if you care about latency or privacy.
Related coverage
More in Industrial
Less than a month after New Glenn exploded on the pad in Florida, Blue Origin says it's back to work and targeting another launch before the end of 2026.
James Chen · 2 hours ago · 5 min
Everyone's talking about phone addiction and attention spans. Bob thinks the real automation story is somewhere else entirely.
Robert "Bob" Macintosh · 5 hours ago · 4 min
A batch of arXiv preprints on simulation environments, 3D scene understanding, and robot memory systems suggests the gap between training robots in sim and deploying them in the real world is finally getting smaller.
Robert "Bob" Macintosh · 13 hours ago · 4 min


