Alert button
Picture for Haohe Liu

Haohe Liu

Alert button

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Add code
Bookmark button
Alert button
Apr 30, 2024
Haohe Liu, Xuenan Xu, Yi Yuan, Mengyue Wu, Wenwu Wang, Mark D. Plumbley

Viaarxiv icon

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Add code
Bookmark button
Alert button
Apr 27, 2024
Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang

Viaarxiv icon

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Add code
Bookmark button
Alert button
Apr 25, 2024
Zhen Ye, Zeqian Ju, Haohe Liu, Xu Tan, Jianyi Chen, Yiwen Lu, Peiwen Sun, Jiahao Pan, Weizhen Bian, Shulin He, Qifeng Liu, Yike Guo, Wei Xue

Viaarxiv icon

WavCraft: Audio Editing and Generation with Natural Language Prompts

Add code
Bookmark button
Alert button
Mar 15, 2024
Jinhua Liang, Huan Zhang, Haohe Liu, Yin Cao, Qiuqiang Kong, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan, Emmanouil Benetos

Figure 1 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 2 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 3 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 4 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Viaarxiv icon

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

Add code
Bookmark button
Alert button
Feb 05, 2024
Jisheng Bai, Mou Wang, Haohe Liu, Han Yin, Yafei Jia, Siwei Huang, Yutong Du, Dongzhe Zhang, Mark D. Plumbley, Dongyuan Shi, Woon-Seng Gan, Susanto Rahardja, Bin Xiang, Jianfeng Chen

Figure 1 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 2 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 3 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 4 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Viaarxiv icon

Balanced SNR-Aware Distillation for Guided Text-to-Audio Generation

Add code
Bookmark button
Alert button
Dec 25, 2023
Bingzhi Liu, Yin Cao, Haohe Liu, Yi Zhou

Viaarxiv icon

First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation

Add code
Bookmark button
Alert button
Oct 22, 2023
Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang

Viaarxiv icon

Synth-AC: Enhancing Audio Captioning with Synthetic Supervision

Add code
Bookmark button
Alert button
Sep 18, 2023
Feiyang Xiao, Qiaoxi Zhu, Jian Guan, Xubo Liu, Haohe Liu, Kejia Zhang, Wenwu Wang

Figure 1 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 2 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 3 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 4 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Viaarxiv icon

Retrieval-Augmented Text-to-Audio Generation

Add code
Bookmark button
Alert button
Sep 14, 2023
Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang

Figure 1 for Retrieval-Augmented Text-to-Audio Generation
Figure 2 for Retrieval-Augmented Text-to-Audio Generation
Figure 3 for Retrieval-Augmented Text-to-Audio Generation
Figure 4 for Retrieval-Augmented Text-to-Audio Generation
Viaarxiv icon

AudioSR: Versatile Audio Super-resolution at Scale

Add code
Bookmark button
Alert button
Sep 13, 2023
Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley

Figure 1 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 2 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 3 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 4 for AudioSR: Versatile Audio Super-resolution at Scale
Viaarxiv icon