Alert button
Picture for Tingxuan Zhong

Tingxuan Zhong

Alert button

Towards End-to-end 4-Bit Inference on Generative Large Language Models

Add code
Bookmark button
Alert button
Oct 13, 2023
Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh

Viaarxiv icon