BeClaude
Research2026-04-30

ChinaTravel: An Open-Ended Travel Planning Benchmark with Compositional Constraint Validation for Language Agents

Source: Arxiv CS.AI

arXiv:2412.13682v5 Announce Type: replace Abstract: Travel planning stands out among real-world applications of \emph{Language Agents} because it couples significant practical demand with a rigorous constraint-satisfaction challenge. However, existing benchmarks primarily operate on a slot-filling...

arxivpapersagentsbenchmark