Picture for Xuming He

Xuming He

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models

Add code
Apr 06, 2024
Figure 1 for From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models
Figure 2 for From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models
Figure 3 for From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models
Figure 4 for From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models
Viaarxiv icon

SP$^2$OT: Semantic-Regularized Progressive Partial Optimal Transport for Imbalanced Clustering

Add code
Apr 04, 2024
Viaarxiv icon

Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning

Add code
Apr 01, 2024
Viaarxiv icon

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

Add code
Mar 21, 2024
Viaarxiv icon

RealDex: Towards Human-like Grasping for Robotic Dexterous Hand

Add code
Feb 21, 2024
Viaarxiv icon

SGTR+: End-to-end Scene Graph Generation with Transformer

Add code
Jan 23, 2024
Viaarxiv icon

P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering

Add code
Jan 17, 2024
Figure 1 for P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering
Figure 2 for P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering
Figure 3 for P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering
Figure 4 for P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering
Viaarxiv icon

Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training

Add code
Jan 04, 2024
Viaarxiv icon

GenEM: Physics-Informed Generative Cryo-Electron Microscopy

Dec 04, 2023
Viaarxiv icon

Gradient-Map-Guided Adaptive Domain Generalization for Cross Modality MRI Segmentation

Add code
Nov 16, 2023
Viaarxiv icon