BeClaude
Research2026-04-20

KWBench: Measuring Unprompted Problem Recognition in Knowledge Work

Source: Arxiv CS.AI

arXiv:2604.15760v1 Announce Type: new Abstract: We introduce the first version of KWBench (Knowledge Work Bench), a benchmark for unprompted problem recognition in large language models: can an LLM identify a professional scenario before attempting to solve it. Existing frontier benchmarks have...

arxivpapersprompting