Research2026-05-12
Auditing Data Membership in Reinforcement Learning With Verifiable Rewards
Source: Arxiv CS.AI
arXiv:2511.14045v2 Announce Type: replace-cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has become a core training stage in recent large language models (LLMs). Its reliance on non-public, high-value prompt sets raises concerns about unauthorized data use, creating a need...
arxivpapersrl