Research2026-04-27
Math Takes Two: A test for emergent mathematical reasoning in communication
Source: Arxiv CS.AI
arXiv:2604.21935v1 Announce Type: new Abstract: Although language models demonstrate remarkable proficiency on mathematical benchmarks, it remains unclear whether this reflects true mathematical reasoning or statistical pattern matching over learning formal syntax. Most existing evaluations rely on...
arxivpapersreasoning