Research2026-05-12

The Gordian Knot for VLMs: Diagrammatic Knot Reasoning as a Hard Benchmark

arXiv:2605.09900v1 Announce Type: new Abstract: A vision-language model can look at a knot diagram and report what it sees, yet fail to act on that structure. KnotBench pairs an 858,318-image corpus from 1,951 prime-knot prototypes (crossing numbers 3 to 19) with a protocol whose answers are...

Read Original Article on Arxiv CS.AI

arxivpapersreasoningbenchmark