Alert button
Picture for Mark D. Plumbley

Mark D. Plumbley

Alert button

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Add code
Bookmark button
Alert button
Apr 30, 2024
Haohe Liu, Xuenan Xu, Yi Yuan, Mengyue Wu, Wenwu Wang, Mark D. Plumbley

Viaarxiv icon

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Add code
Bookmark button
Alert button
Apr 27, 2024
Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang

Viaarxiv icon

WavCraft: Audio Editing and Generation with Natural Language Prompts

Add code
Bookmark button
Alert button
Mar 15, 2024
Jinhua Liang, Huan Zhang, Haohe Liu, Yin Cao, Qiuqiang Kong, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan, Emmanouil Benetos

Figure 1 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 2 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 3 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 4 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Viaarxiv icon

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

Add code
Bookmark button
Alert button
Feb 05, 2024
Jisheng Bai, Mou Wang, Haohe Liu, Han Yin, Yafei Jia, Siwei Huang, Yutong Du, Dongzhe Zhang, Mark D. Plumbley, Dongyuan Shi, Woon-Seng Gan, Susanto Rahardja, Bin Xiang, Jianfeng Chen

Figure 1 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 2 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 3 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 4 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Viaarxiv icon

Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Dec 27, 2023
Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang

Viaarxiv icon

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

Add code
Bookmark button
Alert button
Nov 30, 2023
Jinhua Liang, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan, Emmanouil Benetos

Viaarxiv icon

Retrieval-Augmented Text-to-Audio Generation

Add code
Bookmark button
Alert button
Sep 14, 2023
Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang

Figure 1 for Retrieval-Augmented Text-to-Audio Generation
Figure 2 for Retrieval-Augmented Text-to-Audio Generation
Figure 3 for Retrieval-Augmented Text-to-Audio Generation
Figure 4 for Retrieval-Augmented Text-to-Audio Generation
Viaarxiv icon

AudioSR: Versatile Audio Super-resolution at Scale

Add code
Bookmark button
Alert button
Sep 13, 2023
Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley

Figure 1 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 2 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 3 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 4 for AudioSR: Versatile Audio Super-resolution at Scale
Viaarxiv icon

META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Aug 17, 2023
Jinbo Hu, Yin Cao, Ming Wu, Feiran Yang, Ziying Yu, Wenwu Wang, Mark D. Plumbley, Jun Yang

Figure 1 for META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Figure 2 for META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Figure 3 for META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Viaarxiv icon

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Add code
Bookmark button
Alert button
Aug 10, 2023
Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley

Figure 1 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 2 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 3 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Figure 4 for AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Viaarxiv icon