Alert button
Picture for Zhiliang Peng

Zhiliang Peng

Alert button

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Add code
Bookmark button
Alert button
Oct 04, 2023
Xichen Pan, Li Dong, Shaohan Huang, Zhiliang Peng, Wenhu Chen, Furu Wei

Figure 1 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Figure 2 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Figure 3 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Figure 4 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Viaarxiv icon

Kosmos-2: Grounding Multimodal Large Language Models to the World

Add code
Bookmark button
Alert button
Jul 13, 2023
Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Furu Wei

Figure 1 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 2 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 3 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 4 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Viaarxiv icon

Generic-to-Specific Distillation of Masked Autoencoders

Add code
Bookmark button
Alert button
Feb 28, 2023
Wei Huang, Zhiliang Peng, Li Dong, Furu Wei, Jianbin Jiao, Qixiang Ye

Figure 1 for Generic-to-Specific Distillation of Masked Autoencoders
Figure 2 for Generic-to-Specific Distillation of Masked Autoencoders
Figure 3 for Generic-to-Specific Distillation of Masked Autoencoders
Figure 4 for Generic-to-Specific Distillation of Masked Autoencoders
Viaarxiv icon

A Unified View of Masked Image Modeling

Add code
Bookmark button
Alert button
Oct 19, 2022
Zhiliang Peng, Li Dong, Hangbo Bao, Qixiang Ye, Furu Wei

Figure 1 for A Unified View of Masked Image Modeling
Figure 2 for A Unified View of Masked Image Modeling
Figure 3 for A Unified View of Masked Image Modeling
Figure 4 for A Unified View of Masked Image Modeling
Viaarxiv icon

Foundation Transformers

Add code
Bookmark button
Alert button
Oct 19, 2022
Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei

Figure 1 for Foundation Transformers
Figure 2 for Foundation Transformers
Figure 3 for Foundation Transformers
Figure 4 for Foundation Transformers
Viaarxiv icon

Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks

Add code
Bookmark button
Alert button
Aug 31, 2022
Wenhui Wang, Hangbo Bao, Li Dong, Johan Bjorck, Zhiliang Peng, Qiang Liu, Kriti Aggarwal, Owais Khan Mohammed, Saksham Singhal, Subhojit Som, Furu Wei

Figure 1 for Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Figure 2 for Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Figure 3 for Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Figure 4 for Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Viaarxiv icon

BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers

Add code
Bookmark button
Alert button
Aug 12, 2022
Zhiliang Peng, Li Dong, Hangbo Bao, Qixiang Ye, Furu Wei

Figure 1 for BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Figure 2 for BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Figure 3 for BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Figure 4 for BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Viaarxiv icon

Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection

Add code
Bookmark button
Alert button
May 19, 2022
Xiaosong Zhang, Feng Liu, Zhiliang Peng, Zonghao Guo, Fang Wan, Xiangyang Ji, Qixiang Ye

Figure 1 for Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Figure 2 for Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Figure 3 for Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Figure 4 for Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Viaarxiv icon

Long-tailed Distribution Adaptation

Add code
Bookmark button
Alert button
Oct 06, 2021
Zhiliang Peng, Wei Huang, Zonghao Guo, Xiaosong Zhang, Jianbin Jiao, Qixiang Ye

Figure 1 for Long-tailed Distribution Adaptation
Figure 2 for Long-tailed Distribution Adaptation
Figure 3 for Long-tailed Distribution Adaptation
Figure 4 for Long-tailed Distribution Adaptation
Viaarxiv icon

Conformer: Local Features Coupling Global Representations for Visual Recognition

Add code
Bookmark button
Alert button
May 09, 2021
Zhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, Yaowei Wang, Jianbin Jiao, Qixiang Ye

Figure 1 for Conformer: Local Features Coupling Global Representations for Visual Recognition
Figure 2 for Conformer: Local Features Coupling Global Representations for Visual Recognition
Figure 3 for Conformer: Local Features Coupling Global Representations for Visual Recognition
Figure 4 for Conformer: Local Features Coupling Global Representations for Visual Recognition
Viaarxiv icon