Alert button
Picture for Ju-ho Kim

Ju-ho Kim

Alert button

HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods

Add code
Bookmark button
Alert button
Sep 15, 2023
Hyun-seo Shin, Jungwoo Heo, Ju-ho Kim, Chan-yeong Lim, Wonbin Kim, Ha-Jin Yu

Figure 1 for HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods
Figure 2 for HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods
Figure 3 for HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods
Figure 4 for HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods
Viaarxiv icon

Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models

Add code
Bookmark button
Alert button
Sep 14, 2023
Ju-ho Kim, Jungwoo Heo, Hyun-seo Shin, Chan-yeong Lim, Ha-Jin Yu

Figure 1 for Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models
Figure 2 for Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models
Figure 3 for Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models
Figure 4 for Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models
Viaarxiv icon

PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker Verification

Add code
Bookmark button
Alert button
Jul 20, 2023
Wonbin Kim, Hyun-seo Shin, Ju-ho Kim, Jungwoo Heo, Chan-yeong Lim, Ha-Jin Yu

Figure 1 for PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker Verification
Figure 2 for PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker Verification
Figure 3 for PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker Verification
Viaarxiv icon

One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification

Add code
Bookmark button
Alert button
Jun 08, 2023
Jungwoo Heo, Chan-yeong Lim, Ju-ho Kim, Hyun-seo Shin, Ha-Jin Yu

Figure 1 for One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification
Figure 2 for One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification
Figure 3 for One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification
Figure 4 for One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification
Viaarxiv icon

Integrated Parameter-Efficient Tuning for General-Purpose Audio Models

Add code
Bookmark button
Alert button
Nov 04, 2022
Ju-ho Kim, Jungwoo Heo, Hyun-seo Shin, Chan-yeong Lim, Ha-Jin Yu

Figure 1 for Integrated Parameter-Efficient Tuning for General-Purpose Audio Models
Figure 2 for Integrated Parameter-Efficient Tuning for General-Purpose Audio Models
Figure 3 for Integrated Parameter-Efficient Tuning for General-Purpose Audio Models
Figure 4 for Integrated Parameter-Efficient Tuning for General-Purpose Audio Models
Viaarxiv icon

Convolution channel separation and frequency sub-bands aggregation for music genre classification

Add code
Bookmark button
Alert button
Nov 03, 2022
Jungwoo Heo, Hyun-seo Shin, Ju-ho Kim, Chan-yeong Lim, Ha-Jin Yu

Figure 1 for Convolution channel separation and frequency sub-bands aggregation for music genre classification
Figure 2 for Convolution channel separation and frequency sub-bands aggregation for music genre classification
Figure 3 for Convolution channel separation and frequency sub-bands aggregation for music genre classification
Figure 4 for Convolution channel separation and frequency sub-bands aggregation for music genre classification
Viaarxiv icon

Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector

Add code
Bookmark button
Alert button
Jun 28, 2022
Jungwoo Heo, Ju-ho Kim, Hyun-seo Shin

Figure 1 for Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector
Figure 2 for Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector
Figure 3 for Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector
Figure 4 for Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector
Viaarxiv icon

Extended U-Net for Speaker Verification in Noisy Environments

Add code
Bookmark button
Alert button
Jun 27, 2022
Ju-ho Kim, Jungwoo Heo, Hye-jin Shim, Ha-Jin Yu

Figure 1 for Extended U-Net for Speaker Verification in Noisy Environments
Figure 2 for Extended U-Net for Speaker Verification in Noisy Environments
Figure 3 for Extended U-Net for Speaker Verification in Noisy Environments
Figure 4 for Extended U-Net for Speaker Verification in Noisy Environments
Viaarxiv icon

RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies

Add code
Bookmark button
Alert button
Dec 15, 2021
Ju-ho Kim, Hye-jin Shim, Jungwoo Heo, Ha-Jin Yu

Figure 1 for RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies
Figure 2 for RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies
Figure 3 for RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies
Figure 4 for RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies
Viaarxiv icon

Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes

Add code
Bookmark button
Alert button
Apr 15, 2021
Hye-jin Shim, Ju-ho Kim, Jee-weon Jung, Ha-Jin Yu

Figure 1 for Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes
Figure 2 for Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes
Figure 3 for Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes
Figure 4 for Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes
Viaarxiv icon