Research2026-04-27
ReCast: Recasting Learning Signals for Reinforcement Learning in Generative Recommendation
Source: Arxiv CS.AI
arXiv:2604.22169v1 Announce Type: cross Abstract: Generic group-based RL assumes that sampled rollout groups are already usable learning signals. We show that this assumption breaks down in sparse-hit generative recommendation, where many sampled groups never become learnable at all. We propose...
arxivpapersrl