Research2026-04-27
Universal Transformers Need Memory: Depth-State Trade-offs in Adaptive Recursive Reasoning
Source: Arxiv CS.AI
arXiv:2604.21999v1 Announce Type: cross Abstract: We study learned memory tokens as computational scratchpad for a single-block Universal Transformer (UT) with Adaptive Computation Time (ACT) on Sudoku-Extreme, a combinatorial reasoning benchmark. We find that memory tokens are empirically...
arxivpapersreasoning