Research2026-04-22
Modular Representation Compression: Adapting LLMs for Efficient and Effective Recommendations
Source: Arxiv CS.AI
arXiv:2604.18146v2 Announce Type: replace-cross Abstract: Recently, large language models (LLMs) have advanced recommendation systems (RSs), and recent works have begun to explore how to integrate LLMs into industrial RSs. While most approaches deploy LLMs offline to generate and pre-cache...
arxivpapers