Research2026-05-14

Mind the Gap: How Elicitation Protocols Shape the Stated-Revealed Preference Gap in Language Models

arXiv:2601.21975v2 Announce Type: replace Abstract: Recent work identifies a stated-revealed (SvR) preference gap in language models (LMs): a mismatch between the values models endorse and the choices they make in context. Existing evaluations rely heavily on binary forced-choice prompting, which...

Read Original Article on Arxiv CS.AI

arxivpapers