BeClaude
Back to News
Research2026-04-17

Document-tuning for robust alignment to animals

Source: Arxiv CS.AI

arXiv:2604.13076v1 Announce Type: cross Abstract: We investigate the robustness of value alignment via finetuning with synthetic documents, using animal compassion as a value that is both important in its own right and orthogonal to existing alignment efforts. To evaluate compassionate reasoning,...

arxivpapers