Alert button
Picture for Zuchao Li

Zuchao Li

Alert button

Cross-Modal Adapter: Parameter-Efficient Transfer Learning Approach for Vision-Language Models

Add code
Bookmark button
Alert button
Apr 19, 2024
Juncheng Yang, Zuchao Li, Shuai Xie, Weiping Zhu, Wei Yu, Shijun Li

Viaarxiv icon

Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning

Add code
Bookmark button
Alert button
Apr 06, 2024
Juncheng Yang, Zuchao Li, Shuai Xie, Wei Yu, Shijun Li, Bo Du

Viaarxiv icon

Multi-modal Auto-regressive Modeling via Visual Words

Add code
Bookmark button
Alert button
Mar 12, 2024
Tianshuo Peng, Zuchao Li, Lefei Zhang, Hai Zhao, Ping Wang, Bo Du

Figure 1 for Multi-modal Auto-regressive Modeling via Visual Words
Figure 2 for Multi-modal Auto-regressive Modeling via Visual Words
Figure 3 for Multi-modal Auto-regressive Modeling via Visual Words
Figure 4 for Multi-modal Auto-regressive Modeling via Visual Words
Viaarxiv icon

Sparse is Enough in Fine-tuning Pre-trained Large Language Model

Add code
Bookmark button
Alert button
Dec 19, 2023
Weixi Song, Zuchao Li, Lefei Zhang, Hai Zhao, Bo Du

Viaarxiv icon

A Novel Energy based Model Mechanism for Multi-modal Aspect-Based Sentiment Analysis

Add code
Bookmark button
Alert button
Dec 15, 2023
Tianshuo Peng, Zuchao Li, Ping Wang, Lefei Zhang, Hai Zhao

Figure 1 for A Novel Energy based Model Mechanism for Multi-modal Aspect-Based Sentiment Analysis
Figure 2 for A Novel Energy based Model Mechanism for Multi-modal Aspect-Based Sentiment Analysis
Figure 3 for A Novel Energy based Model Mechanism for Multi-modal Aspect-Based Sentiment Analysis
Figure 4 for A Novel Energy based Model Mechanism for Multi-modal Aspect-Based Sentiment Analysis
Viaarxiv icon

N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding

Add code
Bookmark button
Alert button
Dec 15, 2023
Jinhao Tian, Zuchao Li, Jiajia Li, Ping Wang

Figure 1 for N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding
Figure 2 for N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding
Figure 3 for N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding
Figure 4 for N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding
Viaarxiv icon

Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models

Add code
Bookmark button
Alert button
Dec 14, 2023
Liqi He, Zuchao Li, Xiantao Cai, Ping Wang

Figure 1 for Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models
Figure 2 for Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models
Figure 3 for Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models
Figure 4 for Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models
Viaarxiv icon

Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning

Add code
Bookmark button
Alert button
Dec 02, 2023
Cong Yang, Zuchao Li, Lefei Zhang

Viaarxiv icon

ArcMMLU: A Library and Information Science Benchmark for Large Language Models

Add code
Bookmark button
Alert button
Nov 30, 2023
Shitou Zhang, Zuchao Li, Xingshen Liu, Liming Yang, Ping Wang

Figure 1 for ArcMMLU: A Library and Information Science Benchmark for Large Language Models
Figure 2 for ArcMMLU: A Library and Information Science Benchmark for Large Language Models
Figure 3 for ArcMMLU: A Library and Information Science Benchmark for Large Language Models
Figure 4 for ArcMMLU: A Library and Information Science Benchmark for Large Language Models
Viaarxiv icon

Enhancing Visually-Rich Document Understanding via Layout Structure Modeling

Add code
Bookmark button
Alert button
Aug 15, 2023
Qiwei Li, Zuchao Li, Xiantao Cai, Bo Du, Hai Zhao

Figure 1 for Enhancing Visually-Rich Document Understanding via Layout Structure Modeling
Figure 2 for Enhancing Visually-Rich Document Understanding via Layout Structure Modeling
Figure 3 for Enhancing Visually-Rich Document Understanding via Layout Structure Modeling
Figure 4 for Enhancing Visually-Rich Document Understanding via Layout Structure Modeling
Viaarxiv icon