Research2026-04-22
FASTER: Value-Guided Sampling for Fast RL
Source: Arxiv CS.AI
arXiv:2604.19730v1 Announce Type: cross Abstract: Some of the most performant reinforcement learning algorithms today can be prohibitively expensive as they use test-time scaling methods such as sampling multiple action candidates and selecting the best one. In this work, we propose FASTER, a...
arxivpapers