AUTHOR=Liu Defu , Zhu Yixiao , Liu Zhe , Liu Yi , Han Changlin , Tian Jinkai , Li Ruihao , Yi Wei 

TITLE=A survey of model compression techniques: past, present, and future

JOURNAL=Frontiers in Robotics and AI

VOLUME=Volume 12 - 2025

YEAR=2025

URL=https://www.frontiersin.org/journals/robotics-and-ai/articles/10.3389/frobt.2025.1518965

DOI=10.3389/frobt.2025.1518965

ISSN=2296-9144

ABSTRACT=The exceptional performance of general-purpose large models has driven various industries to focus on developing domain-specific models. However, large models are not only time-consuming and labor-intensive during the training phase but also have very high hardware requirements during the inference phase, such as large memory and high computational power. These requirements pose considerable challenges for the practical deployment of large models. As these challenges intensify, model compression has become a vital research focus to address these limitations. This paper presents a comprehensive review of the evolution of model compression techniques, from their inception to future directions. To meet the urgent demand for efficient deployment, we delve into several compression methods—such as quantization, pruning, low-rank decomposition, and knowledge distillation—emphasizing their fundamental principles, recent advancements, and innovative strategies. By offering insights into the latest developments and their implications for practical applications, this review serves as a valuable technical resource for researchers and practitioners, providing a range of strategies for model deployment and laying the groundwork for future advancements in model compression.