Research2026-04-22
SimDiff: Depth Pruning via Similarity and Difference
Source: Arxiv CS.AI
arXiv:2604.19520v1 Announce Type: new Abstract: Depth pruning improves the deployment efficiency of large language models (LLMs) by identifying and removing redundant layers. A widely accepted standard for this identification process is to measure the similarity between layers using cosine...
arxivpapers