Picture for Bo Zhang

Bo Zhang

MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding

Add code
Jun 06, 2024
Viaarxiv icon

Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving

Add code
May 24, 2024
Viaarxiv icon

DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation

May 24, 2024
Viaarxiv icon

Integer Scale: A Free Lunch for Faster Fine-grained Quantization of LLMs

Add code
May 23, 2024
Viaarxiv icon

ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing

May 22, 2024
Viaarxiv icon

Exploring and Exploiting the Asymmetric Valley of Deep Neural Networks

May 21, 2024
Viaarxiv icon

Discovering Physics-Informed Neural Networks Model for Solving Partial Differential Equations through Evolutionary Computation

Add code
May 18, 2024
Viaarxiv icon

Distilling Implicit Multimodal Knowledge into LLMs for Zero-Resource Dialogue Generation

May 16, 2024
Viaarxiv icon

DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model

May 03, 2024
Viaarxiv icon

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Add code
Apr 29, 2024
Figure 1 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 2 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 3 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 4 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Viaarxiv icon