Alert button

Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation

Add code
Bookmark button
Alert button
Feb 18, 2024
Siyuan Wang, Zhuohan Long, Zhihao Fan, Zhongyu Wei, Xuanjing Huang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: