BeClaude
Policy2026-05-08

SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety

Source: Arxiv CS.AI

arXiv:2605.05704v1 Announce Type: cross Abstract: With the rapid evolution of foundation models, Large Language Model (LLM) agents have demonstrated increasingly powerful tool-use capabilities. However, this proficiency introduces significant security risks, as malicious actors can manipulate...

arxivpapersagentssafety