Alert button
Picture for Xie Chen

Xie Chen

Alert button

AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding

Add code
Bookmark button
Alert button
May 06, 2024
Tao Liu, Feilong Chen, Shuai Fan, Chenpeng Du, Qi Chen, Xie Chen, Kai Yu

Viaarxiv icon

Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech

Add code
Bookmark button
Alert button
Apr 30, 2024
Hankun Wang, Chenpeng Du, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu

Viaarxiv icon

GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting

Add code
Bookmark button
Alert button
Apr 29, 2024
Bo Chen, Shoukang Hu, Qi Chen, Chenpeng Du, Ran Yi, Yanmin Qian, Xie Chen

Viaarxiv icon

MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition

Add code
Bookmark button
Alert button
Apr 29, 2024
Zheng Lian, Haiyang Sun, Licai Sun, Zhuofan Wen, Siyuan Zhang, Shun Chen, Hao Gu, Jinming Zhao, Ziyang Ma, Xie Chen, Jiangyan Yi, Rui Liu, Kele Xu, Bin Liu, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao

Viaarxiv icon

StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

Add code
Bookmark button
Alert button
Apr 23, 2024
Sen Liu, Yiwei Guo, Xie Chen, Kai Yu

Viaarxiv icon

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

Add code
Bookmark button
Alert button
Apr 10, 2024
Yiwei Guo, Chenrun Wang, Yifan Yang, Hankun Wang, Ziyang Ma, Chenpeng Du, Shuai Wang, Hanzheng Li, Shuai Fan, Hui Zhang, Xie Chen, Kai Yu

Viaarxiv icon

Quantum State Generation with Structure-Preserving Diffusion Model

Add code
Bookmark button
Alert button
Apr 09, 2024
Yuchen Zhu, Tianrong Chen, Evangelos A. Theodorou, Xie Chen, Molei Tao

Viaarxiv icon

Advanced Long-Content Speech Recognition With Factorized Neural Transducer

Add code
Bookmark button
Alert button
Mar 20, 2024
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian

Figure 1 for Advanced Long-Content Speech Recognition With Factorized Neural Transducer
Figure 2 for Advanced Long-Content Speech Recognition With Factorized Neural Transducer
Figure 3 for Advanced Long-Content Speech Recognition With Factorized Neural Transducer
Figure 4 for Advanced Long-Content Speech Recognition With Factorized Neural Transducer
Viaarxiv icon

An Embarrassingly Simple Approach for LLM with Strong ASR Capacity

Add code
Bookmark button
Alert button
Feb 13, 2024
Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, Shiliang Zhang, Xie Chen

Viaarxiv icon

BAT: Learning to Reason about Spatial Sounds with Large Language Models

Add code
Bookmark button
Alert button
Feb 02, 2024
Zhisheng Zheng, Puyuan Peng, Ziyang Ma, Xie Chen, Eunsol Choi, David Harwath

Viaarxiv icon