Research2026-05-01
From surveillance to signalling: escalation channels as environmental controls for agentic AI
Source: Arxiv CS.AI
arXiv:2510.05192v2 Announce Type: replace-cross Abstract: When AI agents operating with access to sensitive information encounter a conflict between completing an assigned task and following rules or ethical constraints, they can resort to unsanctioned behaviour. Existing inference time safety work...
arxivpapersagents