Research2026-04-22
QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models
Source: Arxiv CS.AI
arXiv:2601.00679v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have been emerging as prominent AI models for solving many natural language tasks due to their high performance (e.g., accuracy) and capabilities in generating high-quality responses to the given inputs. However,...
arxivpapers