Research2026-04-22
HELM: Harness-Enhanced Long-horizon Memory for Vision-Language-Action Manipulation
Source: Arxiv CS.AI
arXiv:2604.18791v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models fail systematically on long-horizon manipulation tasks despite strong short-horizon performance. We show that this failure is not resolved by extending context length alone in the current reactive execution...
arxivpapersvision