Research2026-05-11
ScrapeGraphAI-100k: Dataset for Schema-Constrained LLM Generation
Source: Arxiv CS.AI
arXiv:2602.15189v2 Announce Type: replace-cross Abstract: Producing output that conforms to a specified JSON schema underlies tool use, structured extraction, and knowledge base construction in modern large language models. Despite this centrality, public datasets for the task remain small,...
arxivpapers