Research2026-05-07
Safety and accuracy follow different scaling laws in clinical large language models
Source: Arxiv CS.AI
arXiv:2605.04039v1 Announce Type: cross Abstract: Clinical LLMs are often scaled by increasing model size, context length, retrieval complexity, or inference-time compute, with the implicit expectation that higher accuracy implies safer behavior. This assumption is incomplete in medicine, where a...
arxivpaperssafety