Alert button

Model Compression and Efficient Inference for Large Language Models: A Survey

Feb 15, 2024
Wenxiao Wang, Wei Chen, Yicong Luo, Yongliu Long, Zhengkai Lin, Liye Zhang, Binbin Lin, Deng Cai, Xiaofei He

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: