Research2026-05-12
Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation
Source: Arxiv CS.AI
arXiv:2601.11258v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) face the "knowledge cutoff" challenge, where their frozen parametric memory prevents direct internalization of new information. While Supervised Fine-Tuning (SFT) is commonly used to update model knowledge, it...
arxivpapers