Research2026-05-12
Agent-X: Full Pipeline Acceleration of On-device AI Agents
Source: Arxiv CS.AI
arXiv:2605.10380v1 Announce Type: new Abstract: LLM-based agents deliver state-of-the-art performance across tasks but incur high end-to-end latency on edge devices. We introduce Agent-X, a software-only, accuracy-preserving framework that accelerates both the prefill and decode stages of on-device...
arxivpapersagents