Research2026-04-22

FASTER: Value-Guided Sampling for Fast RL

arXiv:2604.19730v1 Announce Type: cross Abstract: Some of the most performant reinforcement learning algorithms today can be prohibitively expensive as they use test-time scaling methods such as sampling multiple action candidates and selecting the best one. In this work, we propose FASTER, a...

Read Original Article on Arxiv CS.AI

arxivpapers