Research2026-04-23
CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment
Source: Arxiv CS.AI
arXiv:2510.18471v2 Announce Type: replace-cross Abstract: While Large Language Models (LLMs) excel at code generation by learning from vast code corpora, a fundamental semantic gap remains between their training on textual patterns and the goal of functional correctness, which is governed by formal...
arxivpapersrl