BeClaude
Research2026-05-08

Structural Instability of Feature Composition

Source: Arxiv CS.AI

arXiv:2605.05223v1 Announce Type: cross Abstract: Sparse Autoencoders (SAEs) have emerged as a powerful paradigm for disentangling feature superposition in transformer-based architectures, enabling precise control via activation steering. However, the theoretical foundations of compositional...

arxivpapersstability-ai