Research2026-04-24

Why Do Language Model Agents Whistleblow?

arXiv:2511.17085v3 Announce Type: replace-cross Abstract: The deployment of Large Language Models (LLMs) as tool-using agents causes their alignment training to manifest in new ways. Recent work finds that language models can use tools in ways that contradict the interests or explicit instructions...

Read Original Article on Arxiv CS.AI

arxivpapersagents