Alert button

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Add code
Bookmark button
Alert button
Feb 03, 2024
Yichao Fu, Peter Bailis, Ion Stoica, Hao Zhang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: