Research2026-04-22

QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models

arXiv:2601.00679v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have been emerging as prominent AI models for solving many natural language tasks due to their high performance (e.g., accuracy) and capabilities in generating high-quality responses to the given inputs. However,...

Read Original Article on Arxiv CS.AI

arxivpapers