Picture for Kaitao Song

Kaitao Song

Can Graph Learning Improve Task Planning?

Add code
May 29, 2024
Viaarxiv icon

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Add code
Mar 05, 2024
Figure 1 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 2 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 3 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 4 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Viaarxiv icon

EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction

Add code
Jan 11, 2024
Viaarxiv icon

EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

Add code
Jan 11, 2024
Viaarxiv icon

TaskBench: Benchmarking Large Language Models for Task Automation

Add code
Nov 30, 2023
Viaarxiv icon

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

Add code
Oct 25, 2023
Figure 1 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Figure 2 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Figure 3 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Figure 4 for MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Viaarxiv icon

Learning To Teach Large Language Models Logical Reasoning

Add code
Oct 13, 2023
Viaarxiv icon

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Add code
Sep 15, 2023
Figure 1 for Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Figure 2 for Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Figure 3 for Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Figure 4 for Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Viaarxiv icon

PromptTTS 2: Describing and Generating Voices with Text Prompt

Add code
Sep 05, 2023
Figure 1 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 2 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 3 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Figure 4 for PromptTTS 2: Describing and Generating Voices with Text Prompt
Viaarxiv icon

End-to-End Word-Level Pronunciation Assessment with MASK Pre-training

Add code
Jun 05, 2023
Figure 1 for End-to-End Word-Level Pronunciation Assessment with MASK Pre-training
Figure 2 for End-to-End Word-Level Pronunciation Assessment with MASK Pre-training
Figure 3 for End-to-End Word-Level Pronunciation Assessment with MASK Pre-training
Figure 4 for End-to-End Word-Level Pronunciation Assessment with MASK Pre-training
Viaarxiv icon