BeClaude
Research2026-05-14

Mind the Gap: How Elicitation Protocols Shape the Stated-Revealed Preference Gap in Language Models

Source: Arxiv CS.AI

arXiv:2601.21975v2 Announce Type: replace Abstract: Recent work identifies a stated-revealed (SvR) preference gap in language models (LMs): a mismatch between the values models endorse and the choices they make in context. Existing evaluations rely heavily on binary forced-choice prompting, which...

arxivpapers