Alert button
Picture for Yu Liu

Yu Liu

Alert button

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Add code
Bookmark button
Alert button
Apr 19, 2024
Zhuofan Zong, Bingqi Ma, Dazhong Shen, Guanglu Song, Hao Shao, Dongzhi Jiang, Hongsheng Li, Yu Liu

Viaarxiv icon

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior

Add code
Bookmark button
Alert button
Apr 17, 2024
Zhiheng Liu, Hao Ouyang, Qiuyu Wang, Ka Leong Cheng, Jie Xiao, Kai Zhu, Nan Xue, Yu Liu, Yujun Shen, Yang Cao

Viaarxiv icon

GLID: Pre-training a Generalist Encoder-Decoder Vision Model

Add code
Bookmark button
Alert button
Apr 11, 2024
Jihao Liu, Jinliang Zheng, Yu Liu, Hongsheng Li

Viaarxiv icon

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance

Add code
Bookmark button
Alert button
Apr 08, 2024
Dazhong Shen, Guanglu Song, Zeyue Xue, Fu-Yun Wang, Yu Liu

Viaarxiv icon

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Add code
Bookmark button
Alert button
Apr 04, 2024
Dongzhi Jiang, Guanglu Song, Xiaoshi Wu, Renrui Zhang, Dazhong Shen, Zhuofan Zong, Yu Liu, Hongsheng Li

Viaarxiv icon

Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers

Add code
Bookmark button
Alert button
Mar 29, 2024
Pingcheng Dong, Yonghao Tan, Dong Zhang, Tianwei Ni, Xuejiao Liu, Yu Liu, Peng Luo, Luhong Liang, Shih-Yang Liu, Xijie Huang, Huaiyu Zhu, Yun Pan, Fengwei An, Kwang-Ting Cheng

Figure 1 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Figure 2 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Figure 3 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Figure 4 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Viaarxiv icon

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Add code
Bookmark button
Alert button
Mar 25, 2024
Shilong Zhang, Lianghua Huang, Xi Chen, Yifei Zhang, Zhi-Fan Wu, Yutong Feng, Wei Wang, Yujun Shen, Yu Liu, Ping Luo

Viaarxiv icon

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models

Add code
Bookmark button
Alert button
Mar 25, 2024
Hao Shao, Shengju Qian, Han Xiao, Guanglu Song, Zhuofan Zong, Letian Wang, Yu Liu, Hongsheng Li

Viaarxiv icon

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Add code
Bookmark button
Alert button
Mar 20, 2024
Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang, Xiaoyu Shi, Dazhong Shen, Guanglu Song, Yu Liu, Hongsheng Li

Figure 1 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Figure 2 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Figure 3 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Figure 4 for Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Viaarxiv icon

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis

Add code
Bookmark button
Alert button
Mar 19, 2024
Linjiang Huang, Rongyao Fang, Aiping Zhang, Guanglu Song, Si Liu, Yu Liu, Hongsheng Li

Figure 1 for FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Figure 2 for FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Figure 3 for FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Figure 4 for FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Viaarxiv icon