BeClaude
Partnership2026-04-28

MTRouter: Cost-Aware Multi-Turn LLM Routing with History-Model Joint Embeddings

Source: Arxiv CS.AI

arXiv:2604.23530v1 Announce Type: cross Abstract: Multi-turn, long-horizon tasks are increasingly common for large language models (LLMs), but solving them typically requires many sequential model invocations, accumulating substantial inference costs. Here, we study cost-aware multi-turn LLM...

arxivpapers