Research2026-04-28
Value Alignment Tax: Measuring Value Trade-offs in LLM Alignment
Source: Arxiv CS.AI
arXiv:2602.12134v2 Announce Type: replace Abstract: Existing work on value alignment typically characterizes value relations statically, ignoring how alignment interventions, such as prompting, fine-tuning, or preference optimization, reshape the broader value system. In practice, aligning a target...
arxivpapers