Research2026-05-12
E-TCAV: Formalizing Penultimate Proxies for Efficient Concept Based Interpretability
Source: Arxiv CS.AI
arXiv:2605.10261v1 Announce Type: new Abstract: TCAV (Testing with Concept Activation Vectors) is an interpretability method that assesses the alignment between the internal representations of a trained neural network and human-understandable, high-level concepts. Though effective, TCAV suffers...
arxivpapers