Research2026-05-05
Budget-Aware Routing for Long Clinical Text
Source: Arxiv CS.AI
arXiv:2605.00336v1 Announce Type: cross Abstract: A key challenge for large language models is token cost per query and overall deployment cost. Clinical inputs are long, heterogeneous, and often redundant, while downstream tasks are short and high stakes. We study budgeted context selection, where...
arxivpapers