Picture for Kaining Ying

Kaining Ying

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Add code
Apr 24, 2024
Figure 1 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Figure 2 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Figure 3 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Figure 4 for MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Viaarxiv icon

CTVIS: Consistent Training for Online Video Instance Segmentation

Add code
Jul 24, 2023
Figure 1 for CTVIS: Consistent Training for Online Video Instance Segmentation
Figure 2 for CTVIS: Consistent Training for Online Video Instance Segmentation
Figure 3 for CTVIS: Consistent Training for Online Video Instance Segmentation
Figure 4 for CTVIS: Consistent Training for Online Video Instance Segmentation
Viaarxiv icon

Human-to-Human Interaction Detection

Jul 02, 2023
Figure 1 for Human-to-Human Interaction Detection
Figure 2 for Human-to-Human Interaction Detection
Figure 3 for Human-to-Human Interaction Detection
Figure 4 for Human-to-Human Interaction Detection
Viaarxiv icon

ISDA: Position-Aware Instance Segmentation with Deformable Attention

Feb 23, 2022
Figure 1 for ISDA: Position-Aware Instance Segmentation with Deformable Attention
Figure 2 for ISDA: Position-Aware Instance Segmentation with Deformable Attention
Figure 3 for ISDA: Position-Aware Instance Segmentation with Deformable Attention
Figure 4 for ISDA: Position-Aware Instance Segmentation with Deformable Attention
Viaarxiv icon