Alert button
Picture for Jia-Hong Huang

Jia-Hong Huang

Alert button

Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models

Add code
Bookmark button
Alert button
Apr 29, 2024
Hongyi Zhu, Jia-Hong Huang, Stevan Rudinac, Evangelos Kanoulas

Viaarxiv icon

Conditional Modeling Based Automatic Video Summarization

Add code
Bookmark button
Alert button
Nov 20, 2023
Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring

Viaarxiv icon

Causal Video Summarizer for Video Exploration

Add code
Bookmark button
Alert button
Jul 04, 2023
Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Andrew Brown, Marcel Worring

Figure 1 for Causal Video Summarizer for Video Exploration
Figure 2 for Causal Video Summarizer for Video Exploration
Figure 3 for Causal Video Summarizer for Video Exploration
Figure 4 for Causal Video Summarizer for Video Exploration
Viaarxiv icon

Query-based Video Summarization with Pseudo Label Supervision

Add code
Bookmark button
Alert button
Jul 04, 2023
Jia-Hong Huang, Luka Murn, Marta Mrak, Marcel Worring

Figure 1 for Query-based Video Summarization with Pseudo Label Supervision
Figure 2 for Query-based Video Summarization with Pseudo Label Supervision
Figure 3 for Query-based Video Summarization with Pseudo Label Supervision
Figure 4 for Query-based Video Summarization with Pseudo Label Supervision
Viaarxiv icon

Causalainer: Causal Explainer for Automatic Video Summarization

Add code
Bookmark button
Alert button
Apr 30, 2023
Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring

Figure 1 for Causalainer: Causal Explainer for Automatic Video Summarization
Figure 2 for Causalainer: Causal Explainer for Automatic Video Summarization
Figure 3 for Causalainer: Causal Explainer for Automatic Video Summarization
Figure 4 for Causalainer: Causal Explainer for Automatic Video Summarization
Viaarxiv icon

Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions

Add code
Bookmark button
Alert button
Apr 06, 2023
Jia-Hong Huang, Modar Alfadly, Bernard Ghanem, Marcel Worring

Figure 1 for Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions
Figure 2 for Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions
Figure 3 for Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions
Figure 4 for Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions
Viaarxiv icon

The Dawn of Quantum Natural Language Processing

Add code
Bookmark button
Alert button
Oct 13, 2021
Riccardo Di Sipio, Jia-Hong Huang, Samuel Yen-Chi Chen, Stefano Mangini, Marcel Worring

Figure 1 for The Dawn of Quantum Natural Language Processing
Figure 2 for The Dawn of Quantum Natural Language Processing
Figure 3 for The Dawn of Quantum Natural Language Processing
Figure 4 for The Dawn of Quantum Natural Language Processing
Viaarxiv icon

Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"

Add code
Bookmark button
Alert button
May 30, 2021
Jia-Hong Huang, Ting-Wei Wu, Chao-Han Huck Yang, Marcel Worring

Figure 1 for Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"
Figure 2 for Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"
Figure 3 for Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"
Figure 4 for Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"
Viaarxiv icon

Contextualized Keyword Representations for Multi-modal Retinal Image Captioning

Add code
Bookmark button
Alert button
Apr 26, 2021
Jia-Hong Huang, Ting-Wei Wu, Marcel Worring

Figure 1 for Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Figure 2 for Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Figure 3 for Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Figure 4 for Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Viaarxiv icon

GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization

Add code
Bookmark button
Alert button
Apr 26, 2021
Jia-Hong Huang, Luka Murn, Marta Mrak, Marcel Worring

Figure 1 for GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
Figure 2 for GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
Figure 3 for GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
Figure 4 for GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
Viaarxiv icon