BeClaude
Research2026-05-11

ScrapeGraphAI-100k: Dataset for Schema-Constrained LLM Generation

Source: Arxiv CS.AI

arXiv:2602.15189v2 Announce Type: replace-cross Abstract: Producing output that conforms to a specified JSON schema underlies tool use, structured extraction, and knowledge base construction in modern large language models. Despite this centrality, public datasets for the task remain small,...

arxivpapers