Alert button

MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models

Add code
Bookmark button
Alert button
Jan 30, 2024
Wai-Chung Kwan, Xingshan Zeng, Yuxin Jiang, Yufei Wang, Liangyou Li, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: