Research2026-04-23
Alignment midtraining for animals
Source: Arxiv CS.AI
arXiv:2604.13076v2 Announce Type: replace-cross Abstract: We investigate the robustness of value alignment via finetuning with synthetic documents, using animal compassion as a value that is both important in its own right and orthogonal to existing alignment efforts. To evaluate compassionate...
arxivpapers