BeClaude
Research2026-04-22

QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models

Source: Arxiv CS.AI

arXiv:2601.00679v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have been emerging as prominent AI models for solving many natural language tasks due to their high performance (e.g., accuracy) and capabilities in generating high-quality responses to the given inputs. However,...

arxivpapers