Alert button
Picture for Maosong Sun

Maosong Sun

Alert button

Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches

Add code
Bookmark button
Alert button
Apr 19, 2024
Pablo Biedma, Xiaoyuan Yi, Linus Huang, Maosong Sun, Xing Xie

Viaarxiv icon

UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs

Add code
Bookmark button
Alert button
Apr 11, 2024
Chaoqun He, Renjie Luo, Shengding Hu, Yuanqian Zhao, Jie Zhou, Hanghao Wu, Jiajie Zhang, Xu Han, Zhiyuan Liu, Maosong Sun

Viaarxiv icon

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Add code
Bookmark button
Alert button
Apr 09, 2024
Shengding Hu, Yuge Tu, Xu Han, Chaoqun He, Ganqu Cui, Xiang Long, Zhi Zheng, Yewei Fang, Yuxiang Huang, Weilin Zhao, Xinrong Zhang, Zheng Leng Thai, Kaihuo Zhang, Chongyi Wang, Yuan Yao, Chenyang Zhao, Jie Zhou, Jie Cai, Zhongwu Zhai, Ning Ding, Chao Jia, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun

Viaarxiv icon

Personality-affected Emotion Generation in Dialog Systems

Add code
Bookmark button
Alert button
Apr 03, 2024
Zhiyuan Wen, Jiannong Cao, Jiaxing Shen, Ruosong Yang, Shuaiqi Liu, Maosong Sun

Viaarxiv icon

Advancing LLM Reasoning Generalists with Preference Trees

Add code
Bookmark button
Alert button
Apr 02, 2024
Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun

Viaarxiv icon

Robust and Scalable Model Editing for Large Language Models

Add code
Bookmark button
Alert button
Mar 26, 2024
Yingfa Chen, Zhengyan Zhang, Xu Han, Chaojun Xiao, Zhiyuan Liu, Chen Chen, Kuai Li, Tao Yang, Maosong Sun

Viaarxiv icon

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Add code
Bookmark button
Alert button
Mar 18, 2024
Ruyi Xu, Yuan Yao, Zonghao Guo, Junbo Cui, Zanlin Ni, Chunjiang Ge, Tat-Seng Chua, Zhiyuan Liu, Maosong Sun, Gao Huang

Figure 1 for LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Figure 2 for LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Figure 3 for LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Figure 4 for LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Viaarxiv icon

Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models

Add code
Bookmark button
Alert button
Mar 18, 2024
Ning Ding, Yulin Chen, Ganqu Cui, Xingtai Lv, Weilin Zhao, Ruobing Xie, Bowen Zhou, Zhiyuan Liu, Maosong Sun

Figure 1 for Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Figure 2 for Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Figure 3 for Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Figure 4 for Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Viaarxiv icon

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

Add code
Bookmark button
Alert button
Mar 14, 2024
Sun Ao, Weilin Zhao, Xu Han, Cheng Yang, Zhiyuan Liu, Chuan Shi, Maosong Sun, Shengnan Wang, Teng Su

Figure 1 for BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
Figure 2 for BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
Figure 3 for BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
Figure 4 for BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
Viaarxiv icon