Alert button
Picture for Yicong Hong

Yicong Hong

Alert button

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Mar 01, 2024
Jiazhao Zhang, Kunyu Wang, Rongtao Xu, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, Wang He

Viaarxiv icon

Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

Add code
Bookmark button
Alert button
Nov 23, 2023
Jiahao Li, Hao Tan, Kai Zhang, Zexiang Xu, Fujun Luan, Yinghao Xu, Yicong Hong, Kalyan Sunkavalli, Greg Shakhnarovich, Sai Bi

Figure 1 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Figure 2 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Figure 3 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Figure 4 for Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Viaarxiv icon

LRM: Large Reconstruction Model for Single Image to 3D

Add code
Bookmark button
Alert button
Nov 08, 2023
Yicong Hong, Kai Zhang, Jiuxiang Gu, Sai Bi, Yang Zhou, Difan Liu, Feng Liu, Kalyan Sunkavalli, Trung Bui, Hao Tan

Figure 1 for LRM: Large Reconstruction Model for Single Image to 3D
Figure 2 for LRM: Large Reconstruction Model for Single Image to 3D
Figure 3 for LRM: Large Reconstruction Model for Single Image to 3D
Figure 4 for LRM: Large Reconstruction Model for Single Image to 3D
Viaarxiv icon

Scaling Data Generation in Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Aug 09, 2023
Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao

Figure 1 for Scaling Data Generation in Vision-and-Language Navigation
Figure 2 for Scaling Data Generation in Vision-and-Language Navigation
Figure 3 for Scaling Data Generation in Vision-and-Language Navigation
Figure 4 for Scaling Data Generation in Vision-and-Language Navigation
Viaarxiv icon

Learning Navigational Visual Representations with Semantic Map Supervision

Add code
Bookmark button
Alert button
Jul 23, 2023
Yicong Hong, Yang Zhou, Ruiyi Zhang, Franck Dernoncourt, Trung Bui, Stephen Gould, Hao Tan

Figure 1 for Learning Navigational Visual Representations with Semantic Map Supervision
Figure 2 for Learning Navigational Visual Representations with Semantic Map Supervision
Figure 3 for Learning Navigational Visual Representations with Semantic Map Supervision
Figure 4 for Learning Navigational Visual Representations with Semantic Map Supervision
Viaarxiv icon

NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Add code
Bookmark button
Alert button
May 29, 2023
Gengze Zhou, Yicong Hong, Qi Wu

Figure 1 for NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Figure 2 for NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Figure 3 for NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Figure 4 for NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Viaarxiv icon

Bi-directional Training for Composed Image Retrieval via Text Prompt Learning

Add code
Bookmark button
Alert button
Mar 29, 2023
Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, Stephen Gould

Figure 1 for Bi-directional Training for Composed Image Retrieval via Text Prompt Learning
Figure 2 for Bi-directional Training for Composed Image Retrieval via Text Prompt Learning
Figure 3 for Bi-directional Training for Composed Image Retrieval via Text Prompt Learning
Figure 4 for Bi-directional Training for Composed Image Retrieval via Text Prompt Learning
Viaarxiv icon

1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)

Add code
Bookmark button
Alert button
Jun 26, 2022
Dong An, Zun Wang, Yangguang Li, Yi Wang, Yicong Hong, Yan Huang, Liang Wang, Jing Shao

Figure 1 for 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Figure 2 for 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Figure 3 for 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Figure 4 for 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Viaarxiv icon

HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Mar 22, 2022
Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu

Figure 1 for HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Figure 2 for HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Figure 3 for HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Figure 4 for HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Viaarxiv icon

Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Mar 05, 2022
Yicong Hong, Zun Wang, Qi Wu, Stephen Gould

Figure 1 for Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Figure 2 for Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Figure 3 for Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Figure 4 for Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Viaarxiv icon