Research2026-05-12

Metacognitive Behavioral Tuning of Large Language Models for Multi-Hop Question Answering

arXiv:2602.22508v2 Announce Type: replace Abstract: Large Language Models (LLMs) often produce incorrect answers on multi-hop question answering even when the reasoning trace already contains a correct intermediate conclusion. We attribute this gap to weak self-regulation rather than insufficient...

Read Original Article on Arxiv CS.AI

arxivpapers