Heterogeneous Target Speech Separation

Tzinis, Efthymios
Wichern, Gordon
Subramanian, Aswin
Smaragdis, Paris
Roux, Jonathan Le

Publication date

April 2022

Abstract

We introduce a new paradigm for single-channel target source separation where the sources of interest can be distinguished using non-mutually exclusive concepts (e.g., loudness, gender, language, spatial location, etc). Our proposed heterogeneous separation framework can seamlessly leverage datasets with large distribution shifts and learn cross-domain representations under a variety of concepts used as conditioning. Our experiments show that training separation models with heterogeneous conditions facilitates the generalization to new concepts with unseen out-of-domain data while also performing substantially higher than single-domain specialist models. Notably, such training leads to more robust learning of new harder source separation di...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Heterogeneous Target Speech Separation

Abstract

Extracted data

Heterogeneous Target Speech Separation

Abstract

Extracted data

Related items

Related items