Alert button
Picture for Jiannan Wu

Jiannan Wu

Alert button

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Add code
Bookmark button
Alert button
Apr 19, 2024
Chuofan Ma, Yi Jiang, Jiannan Wu, Zehuan Yuan, Xiaojuan Qi

Viaarxiv icon

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Add code
Bookmark button
Alert button
Jan 15, 2024
Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai

Viaarxiv icon

UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

Add code
Bookmark button
Alert button
Dec 25, 2023
Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo

Viaarxiv icon

Exploring Transformers for Open-world Instance Segmentation

Add code
Bookmark button
Alert button
Aug 08, 2023
Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo

Figure 1 for Exploring Transformers for Open-world Instance Segmentation
Figure 2 for Exploring Transformers for Open-world Instance Segmentation
Figure 3 for Exploring Transformers for Open-world Instance Segmentation
Figure 4 for Exploring Transformers for Open-world Instance Segmentation
Viaarxiv icon

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

Add code
Bookmark button
Alert button
May 25, 2023
Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai

Figure 1 for VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Figure 2 for VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Figure 3 for VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Figure 4 for VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Viaarxiv icon

Multi-Level Contrastive Learning for Dense Prediction Task

Add code
Bookmark button
Alert button
Apr 04, 2023
Qiushan Guo, Yizhou Yu, Yi Jiang, Jiannan Wu, Zehuan Yuan, Ping Luo

Figure 1 for Multi-Level Contrastive Learning for Dense Prediction Task
Figure 2 for Multi-Level Contrastive Learning for Dense Prediction Task
Figure 3 for Multi-Level Contrastive Learning for Dense Prediction Task
Figure 4 for Multi-Level Contrastive Learning for Dense Prediction Task
Viaarxiv icon

Universal Instance Perception as Object Discovery and Retrieval

Add code
Bookmark button
Alert button
Mar 12, 2023
Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu

Figure 1 for Universal Instance Perception as Object Discovery and Retrieval
Figure 2 for Universal Instance Perception as Object Discovery and Retrieval
Figure 3 for Universal Instance Perception as Object Discovery and Retrieval
Figure 4 for Universal Instance Perception as Object Discovery and Retrieval
Viaarxiv icon

Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders

Add code
Bookmark button
Alert button
Oct 09, 2022
Haosen Yang, Deng Huang, Bin Wen, Jiannan Wu, Hongxun Yao, Yi Jiang, Xiatian Zhu, Zehuan Yuan

Figure 1 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Figure 2 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Figure 3 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Figure 4 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Viaarxiv icon

Language as Queries for Referring Video Object Segmentation

Add code
Bookmark button
Alert button
Jan 03, 2022
Jiannan Wu, Yi Jiang, Peize Sun, Zehuan Yuan, Ping Luo

Figure 1 for Language as Queries for Referring Video Object Segmentation
Figure 2 for Language as Queries for Referring Video Object Segmentation
Figure 3 for Language as Queries for Referring Video Object Segmentation
Figure 4 for Language as Queries for Referring Video Object Segmentation
Viaarxiv icon

Towards High-Quality Temporal Action Detection with Sparse Proposals

Add code
Bookmark button
Alert button
Sep 18, 2021
Jiannan Wu, Peize Sun, Shoufa Chen, Jiewen Yang, Zihao Qi, Lan Ma, Ping Luo

Figure 1 for Towards High-Quality Temporal Action Detection with Sparse Proposals
Figure 2 for Towards High-Quality Temporal Action Detection with Sparse Proposals
Figure 3 for Towards High-Quality Temporal Action Detection with Sparse Proposals
Figure 4 for Towards High-Quality Temporal Action Detection with Sparse Proposals
Viaarxiv icon