This paper targets on a generalized vocal mode classifier (speech/singing) that works on audio data from an arbitrary data source. However, previous studies on sound classification are commonly based on cross-validation using a single dataset, without considering the cases that training and testing data are recorded in mismatched condition. Experiments revealed a big difference between homogeneous recognition scenario and heterogeneous recognition scenario, using a new dataset TUT-vocal-2016. In the homogeneous recognition scenario, the classification accuracy using cross-validation on TUT-vocal-2016 was 95.5%. In heterogeneous recognition scenario, seven existing datasets were used as training material and TUT-vocal-2016 was used for testi...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
which permits unrestricted use, distribution, and reproduction in any medium, provided the original ...
Abstract—Song and music discrimination play a significant role in multimedia applications such as ge...
In this paper we describe our recent efforts to improve acoustic-phonetic modeling by developing set...
This thesis focuses on presenting a technique on improving current vocal detection methods. One of t...
Speech recognition in singing is a task that has not been widely researched so far. Singing possesse...
The characteristics of vocal segments in music are an important cue for automatic, content-based mus...
Automatic singing detection and singing phoneme recognition are two MIR research topics that have ga...
Automatic language identification for singing is a topic that has not received much attention for th...
In this thesis the automatic recognition of groups in singing recordings is presented. The classific...
Recognizing and controlling vocal registers during singing is a difficult task for beginner vocalist...
International audienceEnhancing specific parts of a polyphonic music signal is believed to be a prom...
International audienceEnhancing specific parts of a polyphonic music signal is believed to be a prom...
Several approaches have previously been taken to the problem of discriminating between speech and mu...
Abstract — Singing voice detection is essential for content-based applications such as those involvi...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
which permits unrestricted use, distribution, and reproduction in any medium, provided the original ...
Abstract—Song and music discrimination play a significant role in multimedia applications such as ge...
In this paper we describe our recent efforts to improve acoustic-phonetic modeling by developing set...
This thesis focuses on presenting a technique on improving current vocal detection methods. One of t...
Speech recognition in singing is a task that has not been widely researched so far. Singing possesse...
The characteristics of vocal segments in music are an important cue for automatic, content-based mus...
Automatic singing detection and singing phoneme recognition are two MIR research topics that have ga...
Automatic language identification for singing is a topic that has not received much attention for th...
In this thesis the automatic recognition of groups in singing recordings is presented. The classific...
Recognizing and controlling vocal registers during singing is a difficult task for beginner vocalist...
International audienceEnhancing specific parts of a polyphonic music signal is believed to be a prom...
International audienceEnhancing specific parts of a polyphonic music signal is believed to be a prom...
Several approaches have previously been taken to the problem of discriminating between speech and mu...
Abstract — Singing voice detection is essential for content-based applications such as those involvi...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
which permits unrestricted use, distribution, and reproduction in any medium, provided the original ...
Abstract—Song and music discrimination play a significant role in multimedia applications such as ge...