Research2026-05-07

Neuron-Anchored Rule Extraction for Large Language Models via Contrastive Hierarchical Ablation

arXiv:2605.03058v1 Announce Type: cross Abstract: A key goal of explainable AI (XAI) is to express the decision logic of large language models (LLMs) in symbolic form and link it to internal mechanisms. Global rule-extraction methods typically learn symbolic surrogates without grounding rules in...

Read Original Article on Arxiv CS.AI

arxivpapers