Alert button

Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models

Add code
Bookmark button
Alert button
Apr 03, 2024
Taiqiang Wu, Chaofan Tao, Jiahao Wang, Zhe Zhao, Ngai Wong

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: