BeClaude
Research2026-04-27

Universal Transformers Need Memory: Depth-State Trade-offs in Adaptive Recursive Reasoning

Source: Arxiv CS.AI

arXiv:2604.21999v1 Announce Type: cross Abstract: We study learned memory tokens as computational scratchpad for a single-block Universal Transformer (UT) with Adaptive Computation Time (ACT) on Sudoku-Extreme, a combinatorial reasoning benchmark. We find that memory tokens are empirically...

arxivpapersreasoning