Picture for Fuhai Chen

Fuhai Chen

3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting

Add code
Apr 26, 2024
Viaarxiv icon

Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval

Oct 17, 2022
Figure 1 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 2 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 3 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 4 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Viaarxiv icon

Global2Local: A Joint-Hierarchical Attention for Video Captioning

Mar 13, 2022
Figure 1 for Global2Local: A Joint-Hierarchical Attention for Video Captioning
Figure 2 for Global2Local: A Joint-Hierarchical Attention for Video Captioning
Figure 3 for Global2Local: A Joint-Hierarchical Attention for Video Captioning
Figure 4 for Global2Local: A Joint-Hierarchical Attention for Video Captioning
Viaarxiv icon

Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation

Mar 12, 2022
Figure 1 for Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
Figure 2 for Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
Figure 3 for Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
Viaarxiv icon

Differentiated Relevances Embedding for Group-based Referring Expression Comprehension

Add code
Mar 12, 2022
Figure 1 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Figure 2 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Figure 3 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Figure 4 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Viaarxiv icon

Weakly-Supervised Dense Action Anticipation

Add code
Nov 15, 2021
Figure 1 for Weakly-Supervised Dense Action Anticipation
Figure 2 for Weakly-Supervised Dense Action Anticipation
Figure 3 for Weakly-Supervised Dense Action Anticipation
Figure 4 for Weakly-Supervised Dense Action Anticipation
Viaarxiv icon

Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval

Aug 05, 2021
Figure 1 for Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
Figure 2 for Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
Figure 3 for Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
Figure 4 for Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
Viaarxiv icon

Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network

Dec 13, 2020
Figure 1 for Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Figure 2 for Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Figure 3 for Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Figure 4 for Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Viaarxiv icon

Semantic-aware Image Deblurring

Add code
Oct 09, 2019
Figure 1 for Semantic-aware Image Deblurring
Figure 2 for Semantic-aware Image Deblurring
Figure 3 for Semantic-aware Image Deblurring
Figure 4 for Semantic-aware Image Deblurring
Viaarxiv icon

Scene-based Factored Attention for Image Captioning

Add code
Sep 02, 2019
Figure 1 for Scene-based Factored Attention for Image Captioning
Figure 2 for Scene-based Factored Attention for Image Captioning
Figure 3 for Scene-based Factored Attention for Image Captioning
Figure 4 for Scene-based Factored Attention for Image Captioning
Viaarxiv icon