Research2026-04-24
CorridorVLA: Explicit Spatial Constraints for Generative Action Heads via Sparse Anchors
Source: Arxiv CS.AI
arXiv:2604.21241v1 Announce Type: cross Abstract: Vision--Language--Action (VLA) models often use intermediate representations to connect multimodal inputs with continuous control, yet spatial guidance is often injected implicitly through latent features. We propose $CorridorVLA$, which predicts...
arxivpapers