Research2026-05-06
SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering
Source: Arxiv CS.AI
arXiv:2605.02819v1 Announce Type: new Abstract: Large language models excel at complex reasoning, yet evaluating their intermediate steps remains challenging. Although process reward models provide step-wise supervision, they often suffer from a risk compensation effect, where incorrect steps are...
arxivpapers