Picture for Salman Khan

Salman Khan

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning

Add code
Jun 06, 2024
Viaarxiv icon

Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation

Add code
Jun 04, 2024
Viaarxiv icon

Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging

Add code
Jun 01, 2024
Viaarxiv icon

Multi-modal Generation via Cross-Modal In-Context Learning

Add code
May 28, 2024
Viaarxiv icon

Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning

Add code
May 20, 2024
Viaarxiv icon

How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs

Add code
May 08, 2024
Figure 1 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 2 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 3 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 4 for How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Viaarxiv icon

Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs

Add code
May 06, 2024
Figure 1 for Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 2 for Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 3 for Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Figure 4 for Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Viaarxiv icon

Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning

Apr 23, 2024
Viaarxiv icon

Cross-Modal Self-Training: Aligning Images and Pointclouds to Learn Classification without Labels

Add code
Apr 15, 2024
Viaarxiv icon

Progressive Semantic-Guided Vision Transformer for Zero-Shot Learning

Apr 11, 2024
Viaarxiv icon