Research2026-04-24
LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety
Source: Arxiv CS.AI
arXiv:2604.12710v2 Announce Type: replace-cross Abstract: Large language models (LLMs) often demonstrate strong safety performance in high-resource languages, yet exhibit severe vulnerabilities when queried in low-resource languages. We attribute this gap to a mismatch between language-agnostic...
arxivpaperssafety