Alert button
Picture for Toshiyuki Kumakura

Toshiyuki Kumakura

Alert button

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

Add code
Bookmark button
Alert button
May 16, 2022
Yuhta Takida, Takashi Shibuya, WeiHsiang Liao, Chieh-Hsin Lai, Junki Ohmura, Toshimitsu Uesaka, Naoki Murata, Shusuke Takahashi, Toshiyuki Kumakura, Yuki Mitsufuji

Figure 1 for SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Figure 2 for SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Figure 3 for SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Figure 4 for SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Viaarxiv icon

Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end

Add code
Bookmark button
Alert button
Jan 24, 2022
Rem Hida, Masaki Hamada, Chie Kamada, Emiru Tsunoo, Toshiyuki Sekiya, Toshiyuki Kumakura

Figure 1 for Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end
Figure 2 for Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end
Figure 3 for Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end
Figure 4 for Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end
Viaarxiv icon

Towards Online End-to-end Transformer Automatic Speech Recognition

Add code
Bookmark button
Alert button
Oct 25, 2019
Emiru Tsunoo, Yosuke Kashiwagi, Toshiyuki Kumakura, Shinji Watanabe

Figure 1 for Towards Online End-to-end Transformer Automatic Speech Recognition
Figure 2 for Towards Online End-to-end Transformer Automatic Speech Recognition
Figure 3 for Towards Online End-to-end Transformer Automatic Speech Recognition
Figure 4 for Towards Online End-to-end Transformer Automatic Speech Recognition
Viaarxiv icon

Transformer ASR with Contextual Block Processing

Add code
Bookmark button
Alert button
Oct 16, 2019
Emiru Tsunoo, Yosuke Kashiwagi, Toshiyuki Kumakura, Shinji Watanabe

Figure 1 for Transformer ASR with Contextual Block Processing
Figure 2 for Transformer ASR with Contextual Block Processing
Figure 3 for Transformer ASR with Contextual Block Processing
Figure 4 for Transformer ASR with Contextual Block Processing
Viaarxiv icon

End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System

Add code
Bookmark button
Alert button
May 17, 2019
Emiru Tsunoo, Yosuke Kashiwagi, Satoshi Asakawa, Toshiyuki Kumakura

Figure 1 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Figure 2 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Figure 3 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Figure 4 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Viaarxiv icon