Predominant instrument recognition in polyphonic music is addressed using the score-level fusion of two visual representations, namely, Mel-spectrogram and modgdgram. Modgdgram, a visual representation is obtained by stacking modified group delay functions of consecutive frames successively. Convolutional neural networks (CNN) with an attention mechanism, learn the distinctive local characteristics and classify the instrument to the group where it belongs. The proposed system is systematically evaluated using the IRMAS dataset with eleven classes. We train the network using fixed-length singlelabeled audio excerpts and estimate the predominant instruments from variable-length audio recordings. A wave generative adversarial network (WaveGAN)...
This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch est...
Comunicació presentada a: 18th International Society for Music Information Retrieval Conference (ISM...
This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch est...
Although instrument recognition has been thoroughly research, recognition in polyphonic music still ...
While the automatic recognition of musical instruments has seen significant progress, the task is st...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
Comunicació presentada a la International Conference on Multimedia Retrieval celebrada del 6 al 9 de...
This paper presents a method for recognising musical instruments in user-generated videos. Musical i...
Comunicació presentada a la International Conference on Multimedia Retrieval celebrada del 6 al 9 de...
Automatic musical instrument recognition is an important aspect of machine listening. In this projec...
This paper addresses musical sounds recognition produced by different instrument and focus on classi...
This work aims at investigating cross-modal connections between audio and video sources in the task ...
This paper addresses musical sounds recognition produced by different instrument and focus on classi...
International audienceNowadays, deep learning is more and more used for Music Genre Classification: ...
This work aims at investigating cross-modal connections between audio and video sources in the task ...
This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch est...
Comunicació presentada a: 18th International Society for Music Information Retrieval Conference (ISM...
This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch est...
Although instrument recognition has been thoroughly research, recognition in polyphonic music still ...
While the automatic recognition of musical instruments has seen significant progress, the task is st...
Identifying musical instruments in a polyphonic music recording is a difficult yet crucial problem i...
Comunicació presentada a la International Conference on Multimedia Retrieval celebrada del 6 al 9 de...
This paper presents a method for recognising musical instruments in user-generated videos. Musical i...
Comunicació presentada a la International Conference on Multimedia Retrieval celebrada del 6 al 9 de...
Automatic musical instrument recognition is an important aspect of machine listening. In this projec...
This paper addresses musical sounds recognition produced by different instrument and focus on classi...
This work aims at investigating cross-modal connections between audio and video sources in the task ...
This paper addresses musical sounds recognition produced by different instrument and focus on classi...
International audienceNowadays, deep learning is more and more used for Music Genre Classification: ...
This work aims at investigating cross-modal connections between audio and video sources in the task ...
This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch est...
Comunicació presentada a: 18th International Society for Music Information Retrieval Conference (ISM...
This paper presents a convolutional neural network (CNN) that uses input from a polyphonic pitch est...