Alert button
Picture for Hengduo Li

Hengduo Li

Alert button

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Add code
Bookmark button
Alert button
Apr 08, 2024
Bo He, Hengduo Li, Young Kyun Jang, Menglin Jia, Xuefei Cao, Ashish Shah, Abhinav Shrivastava, Ser-Nam Lim

Viaarxiv icon

Object Recognition as Next Token Prediction

Add code
Bookmark button
Alert button
Dec 04, 2023
Kaiyu Yue, Bor-Chun Chen, Jonas Geiping, Hengduo Li, Tom Goldstein, Ser-Nam Lim

Viaarxiv icon

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Add code
Bookmark button
Alert button
Nov 24, 2023
Lingchen Meng, Shiyi Lan, Hengduo Li, Jose M. Alvarez, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning

Add code
Bookmark button
Alert button
May 22, 2023
Wujian Peng, Zejia Weng, Hengduo Li, Zuxuan Wu

Figure 1 for BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning
Figure 2 for BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning
Figure 3 for BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning
Figure 4 for BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning
Viaarxiv icon

Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors

Add code
Bookmark button
Alert button
Sep 30, 2022
Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors
Figure 2 for Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors
Figure 3 for Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors
Figure 4 for Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors
Viaarxiv icon

AdaViT: Adaptive Vision Transformers for Efficient Image Recognition

Add code
Bookmark button
Alert button
Nov 30, 2021
Lingchen Meng, Hengduo Li, Bor-Chun Chen, Shiyi Lan, Zuxuan Wu, Yu-Gang Jiang, Ser-Nam Lim

Figure 1 for AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
Figure 2 for AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
Figure 3 for AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
Figure 4 for AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
Viaarxiv icon

Efficient Video Transformers with Spatial-Temporal Token Selection

Add code
Bookmark button
Alert button
Nov 23, 2021
Junke Wang, Xitong Yang, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for Efficient Video Transformers with Spatial-Temporal Token Selection
Figure 2 for Efficient Video Transformers with Spatial-Temporal Token Selection
Figure 3 for Efficient Video Transformers with Spatial-Temporal Token Selection
Figure 4 for Efficient Video Transformers with Spatial-Temporal Token Selection
Viaarxiv icon

Rethinking Pseudo Labels for Semi-Supervised Object Detection

Add code
Bookmark button
Alert button
Jun 01, 2021
Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis

Figure 1 for Rethinking Pseudo Labels for Semi-Supervised Object Detection
Figure 2 for Rethinking Pseudo Labels for Semi-Supervised Object Detection
Figure 3 for Rethinking Pseudo Labels for Semi-Supervised Object Detection
Figure 4 for Rethinking Pseudo Labels for Semi-Supervised Object Detection
Viaarxiv icon

HMS: Hierarchical Modality Selection for Efficient Video Recognition

Add code
Bookmark button
Alert button
Apr 21, 2021
Zejia Weng, Zuxuan Wu, Hengduo Li, Yu-Gang Jiang

Figure 1 for HMS: Hierarchical Modality Selection for Efficient Video Recognition
Figure 2 for HMS: Hierarchical Modality Selection for Efficient Video Recognition
Figure 3 for HMS: Hierarchical Modality Selection for Efficient Video Recognition
Figure 4 for HMS: Hierarchical Modality Selection for Efficient Video Recognition
Viaarxiv icon

2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition

Add code
Bookmark button
Alert button
Dec 29, 2020
Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis

Figure 1 for 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Figure 2 for 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Figure 3 for 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Figure 4 for 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Viaarxiv icon