The majority of state-of-the-art methods for music information retrieval (MIR) tasks now utilise deep learning methods reliant on minimisation of loss functions such as cross entropy. For tasks that include framewise binary classification (e.g., onset detection, music transcription) classes are derived from output activation functions by identifying points of local maxima, or peaks. However, the operating principles behind peak picking are different to that of the cross entropy loss function, which minimises the absolute difference between the output and target values for a single frame. To generate activation functions more suited to peak-picking, we propose two versions of a new loss function that incorporates information from multiple ti...
The extraction of the beat from musical audio signals represents a foundational task in the field o...
Data cleansing is a well studied strategy for cleaning erroneous labels in datasets, which has not y...
In this paper, an abstract model to predict the genre of a music audio file is proposed (specificall...
The majority of state-of-the-art methods for music infor-mation retrieval (MIR) tasks now utili...
Despite that L1 and L2 loss functions do not represent any perceptually-related information besides ...
Automatic music transcription is considered to be one of the hardest problems in music information r...
This thesis is concerned with the problem of automatic extraction of harmonic and rhythmic informati...
The goal of music information retrieval (MIR) is to develop novel strategies and techniques for orga...
Automatic transcription of polyphonic music remains a challenging task in the field of Music Informa...
Short-term spectral features – and most notably Mel-Frequency Cepstral Coefficients (MFCCs) – are th...
Current state-of-art results in Music Information Retrieval are largely dominated by deep learning a...
Many music information retrieval tasks involve the comparison of a symbolic score representation wit...
Music source separation is a core task in music information retrieval which has seen a dramatic impr...
The Sørensen--Dice Coefficient has recently seen rising popularity as a loss function (also known as...
In music analysis, one of the most fundamental tasks is note onset detection - detecting the beginni...
The extraction of the beat from musical audio signals represents a foundational task in the field o...
Data cleansing is a well studied strategy for cleaning erroneous labels in datasets, which has not y...
In this paper, an abstract model to predict the genre of a music audio file is proposed (specificall...
The majority of state-of-the-art methods for music infor-mation retrieval (MIR) tasks now utili...
Despite that L1 and L2 loss functions do not represent any perceptually-related information besides ...
Automatic music transcription is considered to be one of the hardest problems in music information r...
This thesis is concerned with the problem of automatic extraction of harmonic and rhythmic informati...
The goal of music information retrieval (MIR) is to develop novel strategies and techniques for orga...
Automatic transcription of polyphonic music remains a challenging task in the field of Music Informa...
Short-term spectral features – and most notably Mel-Frequency Cepstral Coefficients (MFCCs) – are th...
Current state-of-art results in Music Information Retrieval are largely dominated by deep learning a...
Many music information retrieval tasks involve the comparison of a symbolic score representation wit...
Music source separation is a core task in music information retrieval which has seen a dramatic impr...
The Sørensen--Dice Coefficient has recently seen rising popularity as a loss function (also known as...
In music analysis, one of the most fundamental tasks is note onset detection - detecting the beginni...
The extraction of the beat from musical audio signals represents a foundational task in the field o...
Data cleansing is a well studied strategy for cleaning erroneous labels in datasets, which has not y...
In this paper, an abstract model to predict the genre of a music audio file is proposed (specificall...