Research2026-05-14
DisaBench: A Participatory Evaluation Framework for Disability Harms in Language Models
Source: Arxiv CS.AI
arXiv:2605.12702v1 Announce Type: new Abstract: General-purpose safety benchmarks for large language models do not adequately evaluate disability-related harms. We introduce DisaBench: a taxonomy of twelve disability harm categories co-created with people with disabilities and red teaming experts,...
arxivpapers