Picture for Shota Nakada

Shota Nakada

On the Audio Hallucinations in Large Audio-Video Language Models

Add code
Jan 18, 2024
Viaarxiv icon

Large-scale Vision-Language Models Learn Super Images for Efficient and High-Performance Partially Relevant Video Retrieval

Add code
Dec 01, 2023
Viaarxiv icon