Policy2026-05-12
RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models
Source: Arxiv CS.AI
arXiv:2605.09410v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models remain brittle in long-horizon, contact-rich manipulation because success-only imitation provides little supervision for execution drift, while failed rollouts are often discarded. We introduce RePO-VLA, a...
arxivpapersvision