Research2026-05-07
Neuron-Anchored Rule Extraction for Large Language Models via Contrastive Hierarchical Ablation
Source: Arxiv CS.AI
arXiv:2605.03058v1 Announce Type: cross Abstract: A key goal of explainable AI (XAI) is to express the decision logic of large language models (LLMs) in symbolic form and link it to internal mechanisms. Global rule-extraction methods typically learn symbolic surrogates without grounding rules in...
arxivpapers