Research2026-05-01
ANCORA: Learning to Question via Manifold-Anchored Self-Play for Verifiable Reasoning
Source: Arxiv CS.AI
arXiv:2604.27644v1 Announce Type: cross Abstract: We propose a paradigm shift from learning to answer to learning to question: can a language model generate verifiable problems, solve them, and turn the resulting feedback into self-improvement without human supervision? We introduce ANCORA, an...
arxivpapersreasoning