Research2026-04-30
Benchmarking the Safety of Large Language Models for Robotic Health Attendant Control
Source: Arxiv CS.AI
arXiv:2604.26577v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly considered for deployment as the control component of robotic health attendants, yet their safety in this context remains poorly characterized. We introduce a dataset of 270 harmful instructions spanning...
arxivpapersbenchmarksafety