BeClaude
Research2026-04-28

Green Shielding: A User-Centric Approach Towards Trustworthy AI

Source: Arxiv CS.AI

arXiv:2604.24700v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed, yet their outputs can be highly sensitive to routine, non-adversarial variation in how users phrase queries, a gap not well addressed by existing red-teaming efforts. We propose Green...

arxivpapers