Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haven Kim

A Computational Analysis of Lyric Similarity Perception

Apr 02, 2024
Haven Kim, Taketo Akama

In musical compositions that include vocals, lyrics significantly contribute to artistic expression. Consequently, previous studies have introduced the concept of a recommendation system that suggests lyrics similar to a user's favorites or personalized preferences, aiding in the discovery of lyrics among millions of tracks. However, many of these systems do not fully consider human perceptions of lyric similarity, primarily due to limited research in this area. To bridge this gap, we conducted a comparative analysis of computational methods for modeling lyric similarity with human perception. Results indicated that computational models based on similarities between embeddings from pre-trained BERT-based models, the audio from which the lyrics are derived, and phonetic components are indicative of perceptual lyric similarity. This finding underscores the importance of semantic, stylistic, and phonetic similarities in human perception about lyric similarity. We anticipate that our findings will enhance the development of similarity-based lyric recommendation systems by offering pseudo-labels for neural network development and introducing objective evaluation metrics.

Via

Access Paper or Ask Questions

K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling

Sep 20, 2023
Haven Kim, Jongmin Jung, Dasaem Jeong, Juhan Nam

Figure 1 for K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling

Figure 2 for K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling

Figure 3 for K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling

Figure 4 for K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling

Lyric translation, a field studied for over a century, is now attracting computational linguistics researchers. We identified two limitations in previous studies. Firstly, lyric translation studies have predominantly focused on Western genres and languages, with no previous study centering on K-pop despite its popularity. Second, the field of lyric translation suffers from a lack of publicly available datasets; to the best of our knowledge, no such dataset exists. To broaden the scope of genres and languages in lyric translation studies, we introduce a novel singable lyric translation dataset, approximately 89\% of which consists of K-pop song lyrics. This dataset aligns Korean and English lyrics line-by-line and section-by-section. We leveraged this dataset to unveil unique characteristics of K-pop lyric translation, distinguishing it from other extensively studied genres, and to construct a neural lyric translation model, thereby underscoring the importance of a dedicated dataset for singable lyric translations.

Via

Access Paper or Ask Questions

The Biased Journey of MSD_AUDIO.ZIP

Sep 02, 2023
Haven Kim, Keunwoo Choi, Mateusz Modrzejewski, Cynthia C. S. Liem

The equitable distribution of academic data is crucial for ensuring equal research opportunities, and ultimately further progress. Yet, due to the complexity of using the API for audio data that corresponds to the Million Song Dataset along with its misreporting (before 2016) and the discontinuation of this API (after 2016), access to this data has become restricted to those within certain affiliations that are connected peer-to-peer. In this paper, we delve into this issue, drawing insights from the experiences of 22 individuals who either attempted to access the data or played a role in its creation. With this, we hope to initiate more critical dialogue and more thoughtful consideration with regard to access privilege in the MIR community.

Via

Access Paper or Ask Questions

A Computational Evaluation Framework for Singable Lyric Translation

Aug 26, 2023
Haven Kim, Kento Watanabe, Masataka Goto, Juhan Nam

Lyric translation plays a pivotal role in amplifying the global resonance of music, bridging cultural divides, and fostering universal connections. Translating lyrics, unlike conventional translation tasks, requires a delicate balance between singability and semantics. In this paper, we present a computational framework for the quantitative evaluation of singable lyric translation, which seamlessly integrates musical, linguistic, and cultural dimensions of lyrics. Our comprehensive framework consists of four metrics that measure syllable count distance, phoneme repetition similarity, musical structure distance, and semantic similarity. To substantiate the efficacy of our framework, we collected a singable lyrics dataset, which precisely aligns English, Japanese, and Korean lyrics on a line-by-line and section-by-section basis, and conducted a comparative analysis between singable and non-singable lyrics. Our multidisciplinary approach provides insights into the key components that underlie the art of lyric translation and establishes a solid groundwork for the future of computational lyric translation assessment.

* ISMIR 2023

Via

Access Paper or Ask Questions

Music Playlist Title Generation Using Artist Information

Jan 14, 2023
Haven Kim, SeungHeon Doh, Junwon Lee, Juhan Nam

Figure 1 for Music Playlist Title Generation Using Artist Information

Figure 2 for Music Playlist Title Generation Using Artist Information

Figure 3 for Music Playlist Title Generation Using Artist Information

Figure 4 for Music Playlist Title Generation Using Artist Information

Automatically generating or captioning music playlist titles given a set of tracks is of significant interest in music streaming services as customized playlists are widely used in personalized music recommendation, and well-composed text titles attract users and help their music discovery. We present an encoder-decoder model that generates a playlist title from a sequence of music tracks. While previous work takes track IDs as tokenized input for playlist title generation, we use artist IDs corresponding to the tracks to mitigate the issue from the long-tail distribution of tracks included in the playlist dataset. Also, we introduce a chronological data split method to deal with newly-released tracks in real-world scenarios. Comparing the track IDs and artist IDs as input sequences, we show that the artist-based approach significantly enhances the performance in terms of word overlap, semantic relevance, and diversity.

* AAAI-23 Workshop on Creative AI Across Modalities

Via

Access Paper or Ask Questions