Policy2026-04-28
Patching LLM Like Software: A Lightweight Method for Improving Safety Policy in Large Language Models
Source: Arxiv CS.AI
arXiv:2511.08484v2 Announce Type: replace Abstract: We propose patching for large language models (LLMs) like software versions, a lightweight and modular approach for addressing safety vulnerabilities. While vendors release improved LLM versions, major releases are costly, infrequent, and...
arxivpaperssafety