Alert button

RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning

Add code
Bookmark button
Alert button
Jan 19, 2024
Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: