Alert button
Picture for Wenze Hu

Wenze Hu

Alert button

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Add code
Bookmark button
Alert button
Sep 29, 2023
Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan

Figure 1 for Guiding Instruction-based Image Editing via Multimodal Large Language Models
Figure 2 for Guiding Instruction-based Image Editing via Multimodal Large Language Models
Figure 3 for Guiding Instruction-based Image Editing via Multimodal Large Language Models
Figure 4 for Guiding Instruction-based Image Editing via Multimodal Large Language Models
Viaarxiv icon

Million-scale Object Detection with Large Vision Model

Add code
Bookmark button
Alert button
Dec 19, 2022
Feng Lin, Wenze Hu, Yaowei Wang, Yonghong Tian, Guangming Lu, Fanglin Chen, Yong Xu, Xiaoyu Wang

Figure 1 for Million-scale Object Detection with Large Vision Model
Figure 2 for Million-scale Object Detection with Large Vision Model
Figure 3 for Million-scale Object Detection with Large Vision Model
Figure 4 for Million-scale Object Detection with Large Vision Model
Viaarxiv icon

NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

Add code
Bookmark button
Alert button
Nov 15, 2022
Yun Yi, Haokui Zhang, Wenze Hu, Nannan Wang, Xiaoyu Wang

Figure 1 for NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Figure 2 for NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Figure 3 for NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Figure 4 for NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Viaarxiv icon

CabViT: Cross Attention among Blocks for Vision Transformer

Add code
Bookmark button
Alert button
Nov 14, 2022
Haokui Zhang, Wenze Hu, Xiaoyu Wang

Figure 1 for CabViT: Cross Attention among Blocks for Vision Transformer
Figure 2 for CabViT: Cross Attention among Blocks for Vision Transformer
Figure 3 for CabViT: Cross Attention among Blocks for Vision Transformer
Figure 4 for CabViT: Cross Attention among Blocks for Vision Transformer
Viaarxiv icon

ParCNetV2: Oversized Kernel with Enhanced Attention

Add code
Bookmark button
Alert button
Nov 14, 2022
Ruihan Xu, Haokui Zhang, Wenze Hu, Shiliang Zhang, Xiaoyu Wang

Figure 1 for ParCNetV2: Oversized Kernel with Enhanced Attention
Figure 2 for ParCNetV2: Oversized Kernel with Enhanced Attention
Figure 3 for ParCNetV2: Oversized Kernel with Enhanced Attention
Figure 4 for ParCNetV2: Oversized Kernel with Enhanced Attention
Viaarxiv icon

Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs

Add code
Bookmark button
Alert button
Oct 08, 2022
Tao Yang, Haokui Zhang, Wenze Hu, Changwen Chen, Xiaoyu Wang

Figure 1 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Figure 2 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Figure 3 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Figure 4 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Viaarxiv icon

ALBench: A Framework for Evaluating Active Learning in Object Detection

Add code
Bookmark button
Alert button
Aug 10, 2022
Zhanpeng Feng, Shiliang Zhang, Rinyoichi Takezoe, Wenze Hu, Manmohan Chandraker, Li-Jia Li, Vijay K. Narayanan, Xiaoyu Wang

Figure 1 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 2 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 3 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 4 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Viaarxiv icon

Implementation of an Automated Learning System for Non-experts

Add code
Bookmark button
Alert button
Mar 26, 2022
Phoenix X. Huang, Zhiwei Zhao, Chao Liu, Jingyi Liu, Wenze Hu, Xiaoyu Wang

Figure 1 for Implementation of an Automated Learning System for Non-experts
Figure 2 for Implementation of an Automated Learning System for Non-experts
Figure 3 for Implementation of an Automated Learning System for Non-experts
Figure 4 for Implementation of an Automated Learning System for Non-experts
Viaarxiv icon

EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers

Add code
Bookmark button
Alert button
Mar 15, 2022
Haokui Zhang, Wenze Hu, Xiaoyu Wang

Figure 1 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Figure 2 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Figure 3 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Figure 4 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Viaarxiv icon

YMIR: A Rapid Data-centric Development Platform for Vision Applications

Add code
Bookmark button
Alert button
Nov 27, 2021
Phoenix X. Huang, Wenze Hu, William Brendel, Manmohan Chandraker, Li-Jia Li, Xiaoyu Wang

Figure 1 for YMIR: A Rapid Data-centric Development Platform for Vision Applications
Figure 2 for YMIR: A Rapid Data-centric Development Platform for Vision Applications
Figure 3 for YMIR: A Rapid Data-centric Development Platform for Vision Applications
Figure 4 for YMIR: A Rapid Data-centric Development Platform for Vision Applications
Viaarxiv icon