Research2026-04-28
Green Shielding: A User-Centric Approach Towards Trustworthy AI
Source: Arxiv CS.AI
arXiv:2604.24700v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed, yet their outputs can be highly sensitive to routine, non-adversarial variation in how users phrase queries, a gap not well addressed by existing red-teaming efforts. We propose Green...
arxivpapers