BeClaude
Research2026-05-06

SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering

Source: Arxiv CS.AI

arXiv:2605.02819v1 Announce Type: new Abstract: Large language models excel at complex reasoning, yet evaluating their intermediate steps remains challenging. Although process reward models provide step-wise supervision, they often suffer from a risk compensation effect, where incorrect steps are...

arxivpapers