Research2026-05-12
VLADriver-RAG: Retrieval-Augmented Vision-Language-Action Models for Autonomous Driving
Source: Arxiv CS.AI
arXiv:2605.08133v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models have emerged as a promising paradigm for end-to-end autonomous driving, yet their reliance on implicit parametric knowledge limits generalization in long-tail scenarios. While Retrieval-Augmented Generation (RAG)...
arxivpapersragvision