Research2026-04-30

Benchmarking the Safety of Large Language Models for Robotic Health Attendant Control

arXiv:2604.26577v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly considered for deployment as the control component of robotic health attendants, yet their safety in this context remains poorly characterized. We introduce a dataset of 270 harmful instructions spanning...

Read Original Article on Arxiv CS.AI

arxivpapersbenchmarksafety