Picture for Zhi-Qi Cheng

Zhi-Qi Cheng

MM-TTS: A Unified Framework for Multimodal, Prompt-Induced Emotional Text-to-Speech Synthesis

Add code
Apr 29, 2024
Viaarxiv icon

LEAF: Unveiling Two Sides of the Same Coin in Semi-supervised Facial Expression Recognition

Add code
Apr 26, 2024
Figure 1 for LEAF: Unveiling Two Sides of the Same Coin in Semi-supervised Facial Expression Recognition
Figure 2 for LEAF: Unveiling Two Sides of the Same Coin in Semi-supervised Facial Expression Recognition
Figure 3 for LEAF: Unveiling Two Sides of the Same Coin in Semi-supervised Facial Expression Recognition
Figure 4 for LEAF: Unveiling Two Sides of the Same Coin in Semi-supervised Facial Expression Recognition
Viaarxiv icon

MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models

Add code
Apr 11, 2024
Figure 1 for MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models
Figure 2 for MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models
Figure 3 for MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models
Figure 4 for MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models
Viaarxiv icon

IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting

Add code
Mar 20, 2024
Figure 1 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Figure 2 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Figure 3 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Figure 4 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Viaarxiv icon

DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception

Add code
Mar 15, 2024
Figure 1 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Figure 2 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Figure 3 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Figure 4 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Viaarxiv icon

FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio

Add code
Mar 04, 2024
Figure 1 for FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Figure 2 for FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Figure 3 for FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Figure 4 for FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Viaarxiv icon

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope

Add code
Jan 12, 2024
Viaarxiv icon

Tracking with Human-Intent Reasoning

Add code
Dec 29, 2023
Viaarxiv icon

ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval

Add code
Dec 19, 2023
Viaarxiv icon

MotionEditor: Editing Video Motion via Content-Aware Diffusion

Add code
Nov 30, 2023
Viaarxiv icon