Alert button
Picture for Hang Su

Hang Su

Alert button

Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models

Add code
Bookmark button
Alert button
Apr 18, 2024
Shouwei Ruan, Yinpeng Dong, Hanqing Liu, Yao Huang, Hang Su, Xingxing Wei

Viaarxiv icon

Exploring the Transferability of Visual Prompting for Multimodal Large Language Models

Add code
Bookmark button
Alert button
Apr 17, 2024
Yichi Zhang, Yinpeng Dong, Siyuan Zhang, Tianzan Min, Hang Su, Jun Zhu

Viaarxiv icon

FaceCat: Enhancing Face Recognition Security with a Unified Generative Model Framework

Add code
Bookmark button
Alert button
Apr 14, 2024
Jiawei Chen, Xiao Yang, Yinpeng Dong, Hang Su, Jianteng Peng, Zhaoxia Yin

Viaarxiv icon

An N-Point Linear Solver for Line and Motion Estimation with Event Cameras

Add code
Bookmark button
Alert button
Apr 01, 2024
Ling Gao, Daniel Gehrig, Hang Su, Davide Scaramuzza, Laurent Kneip

Viaarxiv icon

Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches

Add code
Bookmark button
Alert button
Mar 31, 2024
Lingxuan Wu, Xiao Yang, Yinpeng Dong, Liuwei Xie, Hang Su, Jun Zhu

Viaarxiv icon

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

Add code
Bookmark button
Alert button
Mar 08, 2024
Zhengyi Wang, Yikai Wang, Yifei Chen, Chendong Xiang, Shuo Chen, Dajiang Yu, Chongxuan Li, Hang Su, Jun Zhu

Figure 1 for CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Figure 2 for CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Figure 3 for CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Figure 4 for CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Viaarxiv icon

DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training

Add code
Bookmark button
Alert button
Mar 08, 2024
Zhongkai Hao, Chang Su, Songming Liu, Julius Berner, Chengyang Ying, Hang Su, Anima Anandkumar, Jian Song, Jun Zhu

Figure 1 for DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Figure 2 for DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Figure 3 for DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Figure 4 for DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Viaarxiv icon

Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders

Add code
Bookmark button
Alert button
Mar 07, 2024
Yuwei Zhang, Siffi Singh, Sailik Sengupta, Igor Shalyminov, Hang Su, Hwanjun Song, Saab Mansour

Figure 1 for Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders
Figure 2 for Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders
Figure 3 for Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders
Figure 4 for Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders
Viaarxiv icon

Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection

Add code
Bookmark button
Alert button
Mar 06, 2024
Jianfeng He, Hang Su, Jason Cai, Igor Shalyminov, Hwanjun Song, Saab Mansour

Figure 1 for Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection
Figure 2 for Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection
Figure 3 for Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection
Figure 4 for Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection
Viaarxiv icon

MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets

Add code
Bookmark button
Alert button
Mar 05, 2024
Hossein Aboutalebi, Hwanjun Song, Yusheng Xie, Arshit Gupta, Justin Sun, Hang Su, Igor Shalyminov, Nikolaos Pappas, Siffi Singh, Saab Mansour

Figure 1 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Figure 2 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Figure 3 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Figure 4 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Viaarxiv icon