BeClaude
Research2026-05-07

EvoLM: Self-Evolving Language Models through Co-Evolved Discriminative Rubrics

Source: Arxiv CS.AI

arXiv:2605.03871v1 Announce Type: new Abstract: Language models encode substantial evaluative knowledge from pretraining, yet current post-training methods rely on external supervision (human annotations, proprietary models, or scalar reward models) to produce reward signals. Each imposes a...

arxivpapers