Research2026-05-12
PiCA: Pivot-Based Credit Assignment for Search Agentic Reinforcement Learning
Source: Arxiv CS.AI
arXiv:2605.09287v1 Announce Type: new Abstract: Large Language Model (LLM)-based search agents trained with reinforcement learning (RL) have significantly improved the performance of knowledge-intensive tasks. However, existing methods encounter critical challenges in long-horizon credit...
arxivpapersagentsrl