Research2026-05-11

CyBiasBench: Benchmarking Bias in LLM Agents for Cyber-Attack Scenarios

arXiv:2605.07830v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed as autonomous agents in offensive cybersecurity. In this paper, we reveal an interesting phenomenon: different agents exhibit distinct attack patterns. Specifically, each agent exhibits an...

Read Original Article on Arxiv CS.AI

arxivpapersagentsbenchmark