Alert button
Picture for Wenwu Wang

Wenwu Wang

Alert button

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Add code
Bookmark button
Alert button
Apr 30, 2024
Haohe Liu, Xuenan Xu, Yi Yuan, Mengyue Wu, Wenwu Wang, Mark D. Plumbley

Viaarxiv icon

ComposerX: Multi-Agent Symbolic Music Composition with LLMs

Add code
Bookmark button
Alert button
Apr 30, 2024
Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang, Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo

Viaarxiv icon

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Add code
Bookmark button
Alert button
Apr 27, 2024
Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang

Viaarxiv icon

WavCraft: Audio Editing and Generation with Natural Language Prompts

Add code
Bookmark button
Alert button
Mar 15, 2024
Jinhua Liang, Huan Zhang, Haohe Liu, Yin Cao, Qiuqiang Kong, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan, Emmanouil Benetos

Figure 1 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 2 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 3 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 4 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Viaarxiv icon

Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction

Add code
Bookmark button
Alert button
Dec 15, 2023
Yuanbo Hou, Qiaoqiao Ren, Siyang Song, Yuxin Song, Wenwu Wang, Dick Botteldooren

Viaarxiv icon

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Dec 14, 2023
Davide Berghi, Peipei Wu, Jinzheng Zhao, Wenwu Wang, Philip J. B. Jackson

Figure 1 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 2 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 3 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Viaarxiv icon

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

Add code
Bookmark button
Alert button
Nov 30, 2023
Jinhua Liang, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan, Emmanouil Benetos

Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Bookmark button
Alert button
Oct 23, 2023
Jinzheng Zhao, Yong Xu, Xinyuan Qian, Davide Berghi, Peipei Wu, Meng Cui, Jianyuan Sun, Philip J. B. Jackson, Wenwu Wang

Figure 1 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 2 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 3 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 4 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Viaarxiv icon

First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation

Add code
Bookmark button
Alert button
Oct 22, 2023
Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang

Viaarxiv icon

Transformer-based Autoencoder with ID Constraint for Unsupervised Anomalous Sound Detection

Add code
Bookmark button
Alert button
Oct 13, 2023
Jian Guan, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang

Viaarxiv icon