Research2026-04-24
The Specification Trap: Why Static Value Alignment Alone Is Insufficient for Robust Alignment
Source: Arxiv CS.AI
arXiv:2512.03048v5 Announce Type: replace Abstract: Static content-based AI value alignment is insufficient for robust alignment under capability scaling, distributional shift, and increasing autonomy. This holds for any approach that treats alignment as optimizing toward a fixed formal...
arxivpapers