Neural networks, and especially long short-term memory networks (LSTM), have become increasingly popular for sequence modelling, be it in text, speech, or music. In this paper, we investigate the predictive power of simple LSTM networks for polyphonic MIDI sequences, using an empirical approach. Such systems can then be used as a music language model which, combined with an acoustic model, can improve automatic music transcription (AMT) performance. As a first step, we experiment with synthetic MIDI data, and we compare the results obtained in various settings, throughout the training process. In particular, we compare the use of a fixed sample rate against a musically-relevant sample rate. We test this system both on synthetic and real MID...
In music there are a set of rules a melody must follow in order to sound pleasant to the listener. I...
Automatic transcription of polyphonic music remains a challenging task in the field of Music Informa...
A folk-rnn model is a long short-term memory network (LSTM) that generates music transcriptions. We ...
In this paper, we investigate the use of Music Language Models (MLMs) for improving Automatic Music ...
We investigate the problem of incorporating higher-level symbolic score-like information into Automa...
We present a supervised neural network model for polyphonic piano music transcription. The architect...
In this paper, we introduce a method for converting an input probabilistic piano roll (the output of...
We present a neural network model for polyphonic music transcription. The architecture of the propos...
In this paper, we introduce new methods and discuss results of text-based LSTM (Long Short-Term Memo...
Humans are able to learn and compose complex, yet beautiful, pieces of music as seen in e.g. the hig...
In this paper we take a connectionist machine learning approach to the problem of metre perception a...
This paper proposes a note-based music language model (MLM) for improving note-level polyphonic pian...
In this paper, we introduce a method for converting an input probabilistic piano roll (the output of...
This paper explores sequential modelling of polyphonic music with deep neural networks. While recent...
The automatic composition of music with long-term structure is a central problem in music generation...
In music there are a set of rules a melody must follow in order to sound pleasant to the listener. I...
Automatic transcription of polyphonic music remains a challenging task in the field of Music Informa...
A folk-rnn model is a long short-term memory network (LSTM) that generates music transcriptions. We ...
In this paper, we investigate the use of Music Language Models (MLMs) for improving Automatic Music ...
We investigate the problem of incorporating higher-level symbolic score-like information into Automa...
We present a supervised neural network model for polyphonic piano music transcription. The architect...
In this paper, we introduce a method for converting an input probabilistic piano roll (the output of...
We present a neural network model for polyphonic music transcription. The architecture of the propos...
In this paper, we introduce new methods and discuss results of text-based LSTM (Long Short-Term Memo...
Humans are able to learn and compose complex, yet beautiful, pieces of music as seen in e.g. the hig...
In this paper we take a connectionist machine learning approach to the problem of metre perception a...
This paper proposes a note-based music language model (MLM) for improving note-level polyphonic pian...
In this paper, we introduce a method for converting an input probabilistic piano roll (the output of...
This paper explores sequential modelling of polyphonic music with deep neural networks. While recent...
The automatic composition of music with long-term structure is a central problem in music generation...
In music there are a set of rules a melody must follow in order to sound pleasant to the listener. I...
Automatic transcription of polyphonic music remains a challenging task in the field of Music Informa...
A folk-rnn model is a long short-term memory network (LSTM) that generates music transcriptions. We ...