As part of the research into content-based music information retrieval (MIR), this paper presents a preliminary attempt to automatically identify the language sung in popular music recordings. It is assumed that each language has its own set of constraints that specify the sequence of basic linguistic events when lyrics are sung. Thus, the acoustic structure of individual languages may be characterized by statistically modeling those constraints. To achieve this, the proposed method employs vector clustering to convert a singing signal from its spectrum-based feature representation into a sequence of smaller basic phonological units. The dynamic characteristics of the sequence are then analyzed using bigram language models. As vector cluste...
In the past decades, many successful approaches for language identification have been published. How...
The purpose of this study was to investigate how metadata from Spotify could be used to identify the...
We propose a multimodal singing language classification model that uses both audio content and textu...
Sung language recognition relies on both effective feature extraction and acoustic modeling. In this...
Automatic language identification for singing is a topic that has not received much attention for th...
This paper studies the influence of n-gram language models in the recognition of sung phonemes and w...
Automatic singing detection and singing phoneme recognition are two MIR research topics that have ga...
This dissertation is concerned with the problem of describing the singing voice within the audio sig...
Abstract. This paper investigates the problem of retrieving Karaoke music by singing. The Karaoke mu...
This paper presents an effective technique for automatically clustering undocumented music recording...
We propose a statistical learning approach for the automatic detection of vocal regions in a polypho...
In this paper, we focus on singing techniques within the scope of music information retrieval resear...
With the high increase in the availability of digital music, it has become of interest to automatica...
In the field of sound and music computing, only a handful of studies are concerned with the pursuit ...
This thesis proposes signal processing methods for analysis of singing voice audio signals, with the...
In the past decades, many successful approaches for language identification have been published. How...
The purpose of this study was to investigate how metadata from Spotify could be used to identify the...
We propose a multimodal singing language classification model that uses both audio content and textu...
Sung language recognition relies on both effective feature extraction and acoustic modeling. In this...
Automatic language identification for singing is a topic that has not received much attention for th...
This paper studies the influence of n-gram language models in the recognition of sung phonemes and w...
Automatic singing detection and singing phoneme recognition are two MIR research topics that have ga...
This dissertation is concerned with the problem of describing the singing voice within the audio sig...
Abstract. This paper investigates the problem of retrieving Karaoke music by singing. The Karaoke mu...
This paper presents an effective technique for automatically clustering undocumented music recording...
We propose a statistical learning approach for the automatic detection of vocal regions in a polypho...
In this paper, we focus on singing techniques within the scope of music information retrieval resear...
With the high increase in the availability of digital music, it has become of interest to automatica...
In the field of sound and music computing, only a handful of studies are concerned with the pursuit ...
This thesis proposes signal processing methods for analysis of singing voice audio signals, with the...
In the past decades, many successful approaches for language identification have been published. How...
The purpose of this study was to investigate how metadata from Spotify could be used to identify the...
We propose a multimodal singing language classification model that uses both audio content and textu...