Research2026-05-11
Tool Calling is Linearly Readable and Steerable in Language Models
Source: Arxiv CS.AI
arXiv:2605.07990v1 Announce Type: cross Abstract: When a tool-calling agent picks the wrong tool, the failure is invisible until execution: the email gets sent, the meeting gets missed. Probing 12 instruction-tuned models across Gemma 3, Qwen 3, Qwen 2.5, and Llama 3.1 (270M to 27B), we find the...
arxivpapers