Research2026-04-28

How Much Heavy Lifting Can an Agent Harness Do?: Measuring the LLM's Residual Role in a Planning Agent

arXiv:2604.07236v3 Announce Type: replace Abstract: Agent harnesses -- the stateful programs that wrap a language model and decide what it sees at each step -- are now known to change end-to-end performance on a fixed model by as much as six times. That observation raises a question asked less...

Read Original Article on Arxiv CS.AI

arxivpapersagents