Picture for Yuxing Long

Yuxing Long

InstructNav: Zero-shot System for Generic Instruction Navigation in Unexplored Environment

Jun 07, 2024
Viaarxiv icon

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation

Dec 24, 2023
Viaarxiv icon

Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill

Add code
Sep 21, 2023
Figure 1 for Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill
Figure 2 for Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill
Figure 3 for Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill
Figure 4 for Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill
Viaarxiv icon

Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions

Sep 20, 2023
Figure 1 for Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Figure 2 for Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Figure 3 for Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Figure 4 for Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Viaarxiv icon

VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue

Add code
Sep 14, 2023
Figure 1 for VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue
Figure 2 for VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue
Figure 3 for VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue
Figure 4 for VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue
Viaarxiv icon

Whether you can locate or not? Interactive Referring Expression Generation

Add code
Aug 19, 2023
Figure 1 for Whether you can locate or not? Interactive Referring Expression Generation
Figure 2 for Whether you can locate or not? Interactive Referring Expression Generation
Figure 3 for Whether you can locate or not? Interactive Referring Expression Generation
Figure 4 for Whether you can locate or not? Interactive Referring Expression Generation
Viaarxiv icon

Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark

Add code
May 26, 2023
Figure 1 for Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark
Figure 2 for Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark
Figure 3 for Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark
Figure 4 for Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark
Viaarxiv icon

SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph

Add code
Jan 05, 2023
Figure 1 for SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph
Figure 2 for SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph
Figure 3 for SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph
Figure 4 for SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph
Viaarxiv icon