Latent feature augmentation for chorus detection

Xingjian Du
Huidong Liang
Yuan Wan
Yuheng Lin
Ke Chen
Bilei Zhu
Zejun Ma

Open link

Publication date

December 2022

DOI

10.5281/zenodo.7316640

Publisher

ISMIR

Abstract

In this paper, we introduce LA-Chorus, a chorus detection model based on latent feature augmentation and ResNet FPN architecture. Our contributions in LA-Chorus are three-fold. Firstly, we propose a method for implicitly augmenting chorus data in the latent space during the train7 ing stage. Compared to augmentations on audio surfaces such as time stretching and pitch shifting, latent augmentations indicate changes at a higher level in original audio, thereby increasing the diversity and sufficiency in training. Second, we apply Feature Pyramid Network (FPN) to generate additional embeddings from low dimension to high dimension, consequently achieving a multi-scale training paradigm. Lastly, we release Di-Chorus, a new open-source dataset o...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Latent feature augmentation for chorus detection

Abstract

Extracted data

Latent feature augmentation for chorus detection

Abstract

Extracted data

Related items

Related items