Alert button
Picture for Ximeng Sun

Ximeng Sun

Alert button

Koala: Key frame-conditioned long video-LLM

Add code
Bookmark button
Alert button
Apr 05, 2024
Reuben Tan, Ximeng Sun, Ping Hu, Jui-hsien Wang, Hanieh Deilamsalehy, Bryan A. Plummer, Bryan Russell, Kate Saenko

Figure 1 for Koala: Key frame-conditioned long video-LLM
Figure 2 for Koala: Key frame-conditioned long video-LLM
Figure 3 for Koala: Key frame-conditioned long video-LLM
Figure 4 for Koala: Key frame-conditioned long video-LLM
Viaarxiv icon

CLAMP: Contrastive LAnguage Model Prompt-tuning

Add code
Bookmark button
Alert button
Dec 04, 2023
Piotr Teterwak, Ximeng Sun, Bryan A. Plummer, Kate Saenko, Ser-Nam Lim

Viaarxiv icon

Label Budget Allocation in Multi-Task Learning

Add code
Bookmark button
Alert button
Aug 24, 2023
Ximeng Sun, Kihyuk Sohn, Kate Saenko, Clayton Mellina, Xiao Bian

Figure 1 for Label Budget Allocation in Multi-Task Learning
Figure 2 for Label Budget Allocation in Multi-Task Learning
Figure 3 for Label Budget Allocation in Multi-Task Learning
Figure 4 for Label Budget Allocation in Multi-Task Learning
Viaarxiv icon

DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations

Add code
Bookmark button
Alert button
Aug 03, 2023
Ping Hu, Ximeng Sun, Stan Sclaroff, Kate Saenko

Figure 1 for DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations
Figure 2 for DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations
Figure 3 for DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations
Figure 4 for DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations
Viaarxiv icon

DIME-FM: DIstilling Multimodal and Efficient Foundation Models

Add code
Bookmark button
Alert button
Mar 31, 2023
Ximeng Sun, Pengchuan Zhang, Peizhao Zhang, Hardik Shah, Kate Saenko, Xide Xia

Figure 1 for DIME-FM: DIstilling Multimodal and Efficient Foundation Models
Figure 2 for DIME-FM: DIstilling Multimodal and Efficient Foundation Models
Figure 3 for DIME-FM: DIstilling Multimodal and Efficient Foundation Models
Figure 4 for DIME-FM: DIstilling Multimodal and Efficient Foundation Models
Viaarxiv icon

DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations

Add code
Bookmark button
Alert button
Jun 20, 2022
Ximeng Sun, Ping Hu, Kate Saenko

Figure 1 for DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations
Figure 2 for DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations
Figure 3 for DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations
Figure 4 for DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations
Viaarxiv icon

Dynamic Network Quantization for Efficient Video Inference

Add code
Bookmark button
Alert button
Aug 23, 2021
Ximeng Sun, Rameswar Panda, Chun-Fu Chen, Aude Oliva, Rogerio Feris, Kate Saenko

Figure 1 for Dynamic Network Quantization for Efficient Video Inference
Figure 2 for Dynamic Network Quantization for Efficient Video Inference
Figure 3 for Dynamic Network Quantization for Efficient Video Inference
Figure 4 for Dynamic Network Quantization for Efficient Video Inference
Viaarxiv icon

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

Add code
Bookmark button
Alert button
May 12, 2021
Rameswar Panda, Chun-Fu Chen, Quanfu Fan, Ximeng Sun, Kate Saenko, Aude Oliva, Rogerio Feris

Figure 1 for AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
Figure 2 for AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
Figure 3 for AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
Figure 4 for AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
Viaarxiv icon

All at Once Network Quantization via Collaborative Knowledge Transfer

Add code
Bookmark button
Alert button
Mar 02, 2021
Ximeng Sun, Rameswar Panda, Chun-Fu Chen, Naigang Wang, Bowen Pan Kailash Gopalakrishnan, Aude Oliva, Rogerio Feris, Kate Saenko

Figure 1 for All at Once Network Quantization via Collaborative Knowledge Transfer
Figure 2 for All at Once Network Quantization via Collaborative Knowledge Transfer
Figure 3 for All at Once Network Quantization via Collaborative Knowledge Transfer
Figure 4 for All at Once Network Quantization via Collaborative Knowledge Transfer
Viaarxiv icon

Revisiting Few-shot Activity Detection with Class Similarity Control

Add code
Bookmark button
Alert button
Mar 31, 2020
Huijuan Xu, Ximeng Sun, Eric Tzeng, Abir Das, Kate Saenko, Trevor Darrell

Figure 1 for Revisiting Few-shot Activity Detection with Class Similarity Control
Figure 2 for Revisiting Few-shot Activity Detection with Class Similarity Control
Figure 3 for Revisiting Few-shot Activity Detection with Class Similarity Control
Figure 4 for Revisiting Few-shot Activity Detection with Class Similarity Control
Viaarxiv icon