Picture for Zhongyu Wei

Zhongyu Wei

VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models

Add code
May 28, 2024
Viaarxiv icon

Beyond ESM2: Graph-Enhanced Protein Sequence Modeling with Efficient Clustering

Add code
Apr 24, 2024
Viaarxiv icon

DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning

Add code
Apr 02, 2024
Figure 1 for DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
Figure 2 for DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
Figure 3 for DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
Figure 4 for DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
Viaarxiv icon

ALaRM: Align Language Models via Hierarchical Rewards Modeling

Add code
Mar 16, 2024
Figure 1 for ALaRM: Align Language Models via Hierarchical Rewards Modeling
Figure 2 for ALaRM: Align Language Models via Hierarchical Rewards Modeling
Figure 3 for ALaRM: Align Language Models via Hierarchical Rewards Modeling
Figure 4 for ALaRM: Align Language Models via Hierarchical Rewards Modeling
Viaarxiv icon

Debatrix: Multi-dimensinal Debate Judge with Iterative Chronological Analysis Based on LLM

Add code
Mar 12, 2024
Figure 1 for Debatrix: Multi-dimensinal Debate Judge with Iterative Chronological Analysis Based on LLM
Figure 2 for Debatrix: Multi-dimensinal Debate Judge with Iterative Chronological Analysis Based on LLM
Figure 3 for Debatrix: Multi-dimensinal Debate Judge with Iterative Chronological Analysis Based on LLM
Figure 4 for Debatrix: Multi-dimensinal Debate Judge with Iterative Chronological Analysis Based on LLM
Viaarxiv icon

Android in the Zoo: Chain-of-Action-Thought for GUI Agents

Add code
Mar 05, 2024
Figure 1 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 2 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 3 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 4 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Viaarxiv icon

Unveiling the Truth and Facilitating Change: Towards Agent-based Large-scale Social Movement Simulation

Add code
Feb 26, 2024
Viaarxiv icon

AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis

Add code
Feb 21, 2024
Viaarxiv icon

SoMeLVLM: A Large Vision Language Model for Social Media Processing

Add code
Feb 20, 2024
Viaarxiv icon

Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs

Add code
Feb 18, 2024
Viaarxiv icon