BeClaude
Research2026-04-27

Mechanistic Interpretability of Antibody Language Models Using SAEs

Source: Arxiv CS.AI

arXiv:2512.05794v2 Announce Type: replace-cross Abstract: Sparse autoencoders (SAEs) are a mechanistic interpretability technique that have been used to provide insight into learned concepts within large protein language models. Here, we employ TopK and Ordered SAEs to investigate autoregressive...

arxivpapers