Research2026-05-12
The Gordian Knot for VLMs: Diagrammatic Knot Reasoning as a Hard Benchmark
Source: Arxiv CS.AI
arXiv:2605.09900v1 Announce Type: new Abstract: A vision-language model can look at a knot diagram and report what it sees, yet fail to act on that structure. KnotBench pairs an 858,318-image corpus from 1,951 prime-knot prototypes (crossing numbers 3 to 19) with a protocol whose answers are...
arxivpapersreasoningbenchmark