Alert button
Picture for Shengyu Liu

Shengyu Liu

Alert button

LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism

Add code
Bookmark button
Alert button
Apr 15, 2024
Bingyang Wu, Shengyu Liu, Yinmin Zhong, Peng Sun, Xuanzhe Liu, Xin Jin

Viaarxiv icon

Learning Accurate Performance Predictors for Ultrafast Automated Model Compression

Add code
Bookmark button
Alert button
Apr 13, 2023
Ziwei Wang, Jiwen Lu, Han Xiao, Shengyu Liu, Jie Zhou

Viaarxiv icon