Picture for Leda Sari

Leda Sari

Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model

Add code
Sep 22, 2023
Figure 1 for Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model
Figure 2 for Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model
Figure 3 for Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model
Figure 4 for Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model
Viaarxiv icon

Augmenting text for spoken language understanding with Large Language Models

Add code
Sep 17, 2023
Figure 1 for Augmenting text for spoken language understanding with Large Language Models
Figure 2 for Augmenting text for spoken language understanding with Large Language Models
Figure 3 for Augmenting text for spoken language understanding with Large Language Models
Figure 4 for Augmenting text for spoken language understanding with Large Language Models
Viaarxiv icon

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

Add code
Jun 23, 2023
Figure 1 for Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Figure 2 for Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Figure 3 for Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Figure 4 for Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Viaarxiv icon

Self-Supervised Representations for Singing Voice Conversion

Add code
Mar 21, 2023
Figure 1 for Self-Supervised Representations for Singing Voice Conversion
Figure 2 for Self-Supervised Representations for Singing Voice Conversion
Figure 3 for Self-Supervised Representations for Singing Voice Conversion
Figure 4 for Self-Supervised Representations for Singing Voice Conversion
Viaarxiv icon

Biased Self-supervised learning for ASR

Add code
Nov 04, 2022
Figure 1 for Biased Self-supervised learning for ASR
Figure 2 for Biased Self-supervised learning for ASR
Figure 3 for Biased Self-supervised learning for ASR
Viaarxiv icon

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Add code
Oct 13, 2021
Figure 1 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 2 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 3 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 4 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Viaarxiv icon

Identify Speakers in Cocktail Parties with End-to-End Attention

Add code
May 22, 2020
Figure 1 for Identify Speakers in Cocktail Parties with End-to-End Attention
Figure 2 for Identify Speakers in Cocktail Parties with End-to-End Attention
Figure 3 for Identify Speakers in Cocktail Parties with End-to-End Attention
Figure 4 for Identify Speakers in Cocktail Parties with End-to-End Attention
Viaarxiv icon