Alert button
Picture for Zhen Leng Thai

Zhen Leng Thai

Alert button

$\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens

Add code
Bookmark button
Alert button
Feb 24, 2024
Xinrong Zhang, Yingfa Chen, Shengding Hu, Zihang Xu, Junhao Chen, Moo Khai Hao, Xu Han, Zhen Leng Thai, Shuo Wang, Zhiyuan Liu, Maosong Sun

Figure 1 for $\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens
Figure 2 for $\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens
Figure 3 for $\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens
Figure 4 for $\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens
Viaarxiv icon

OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems

Add code
Bookmark button
Alert button
Feb 21, 2024
Chaoqun He, Renjie Luo, Yuzhuo Bai, Shengding Hu, Zhen Leng Thai, Junhao Shen, Jinyi Hu, Xu Han, Yujie Huang, Yuxiang Zhang, Jie Liu, Lei Qi, Zhiyuan Liu, Maosong Sun

Figure 1 for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems
Figure 2 for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems
Figure 3 for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems
Figure 4 for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems
Viaarxiv icon