Research2026-04-20
KWBench: Measuring Unprompted Problem Recognition in Knowledge Work
Source: Arxiv CS.AI
arXiv:2604.15760v1 Announce Type: new Abstract: We introduce the first version of KWBench (Knowledge Work Bench), a benchmark for unprompted problem recognition in large language models: can an LLM identify a professional scenario before attempting to solve it. Existing frontier benchmarks have...
arxivpapersprompting