Alert button
Picture for Jun-Yan He

Jun-Yan He

Alert button

MM-TTS: A Unified Framework for Multimodal, Prompt-Induced Emotional Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Apr 29, 2024
Xiang Li, Zhi-Qi Cheng, Jun-Yan He, Xiaojiang Peng, Alexander G. Hauptmann

Viaarxiv icon

Exploring Dynamic Transformer for Efficient Object Tracking

Add code
Bookmark button
Alert button
Mar 26, 2024
Jiawen Zhu, Xin Chen, Haiwen Diao, Shuai Li, Jun-Yan He, Chenyang Li, Bin Luo, Dong Wang, Huchuan Lu

Figure 1 for Exploring Dynamic Transformer for Efficient Object Tracking
Figure 2 for Exploring Dynamic Transformer for Efficient Object Tracking
Figure 3 for Exploring Dynamic Transformer for Efficient Object Tracking
Figure 4 for Exploring Dynamic Transformer for Efficient Object Tracking
Viaarxiv icon

DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception

Add code
Bookmark button
Alert button
Mar 15, 2024
Xiang Huang, Zhi-Qi Cheng, Jun-Yan He, Chenyang Li, Wangmeng Xiang, Baigui Sun, Xiao Wu

Figure 1 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Figure 2 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Figure 3 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Figure 4 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Viaarxiv icon

Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception

Add code
Bookmark button
Alert button
Mar 05, 2024
Junwen He, Yifan Wang, Lijun Wang, Huchuan Lu, Jun-Yan He, Jin-Peng Lan, Bin Luo, Xuansong Xie

Figure 1 for Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Figure 2 for Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Figure 3 for Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Figure 4 for Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
Viaarxiv icon

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope

Add code
Bookmark button
Alert button
Jan 12, 2024
Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, Jingdong Sun, Wangmeng Xiang, Yusen Hu, Xianhui Lin, Xiaoyang Kang, Zengke Jin, Bin Luo, Yifeng Geng, Xuansong Xie, Jingren Zhou

Viaarxiv icon

Tracking with Human-Intent Reasoning

Add code
Bookmark button
Alert button
Dec 29, 2023
Jiawen Zhu, Zhi-Qi Cheng, Jun-Yan He, Chenyang Li, Bin Luo, Huchuan Lu, Yifeng Geng, Xuansong Xie

Viaarxiv icon

AnyText: Multilingual Visual Text Generation And Editing

Add code
Bookmark button
Alert button
Nov 07, 2023
Yuxiang Tuo, Wangmeng Xiang, Jun-Yan He, Yifeng Geng, Xuansong Xie

Viaarxiv icon

WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models

Add code
Bookmark button
Alert button
Oct 20, 2023
Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, Jingdong Sun, Wangmeng Xiang, Xianhui Lin, Xiaoyang Kang, Zengke Jin, Yusen Hu, Bin Luo, Yifeng Geng, Xuansong Xie, Jingren Zhou

Viaarxiv icon

DCPT: Darkness Clue-Prompted Tracking in Nighttime UAVs

Add code
Bookmark button
Alert button
Sep 19, 2023
Jiawen Zhu, Huayi Tang, Zhi-Qi Cheng, Jun-Yan He, Bin Luo, Shihao Qiu, Shengming Li, Huchuan Lu

Figure 1 for DCPT: Darkness Clue-Prompted Tracking in Nighttime UAVs
Figure 2 for DCPT: Darkness Clue-Prompted Tracking in Nighttime UAVs
Figure 3 for DCPT: Darkness Clue-Prompted Tracking in Nighttime UAVs
Figure 4 for DCPT: Darkness Clue-Prompted Tracking in Nighttime UAVs
Viaarxiv icon

Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation

Add code
Bookmark button
Alert button
Sep 06, 2023
Hanbing Liu, Wangmeng Xiang, Jun-Yan He, Zhi-Qi Cheng, Bin Luo, Yifeng Geng, Xuansong Xie

Figure 1 for Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation
Figure 2 for Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation
Figure 3 for Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation
Figure 4 for Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation
Viaarxiv icon