Online Speaker Separation Using Deep Clustering

Wang, Shanshan

Publication date

November 2019

Abstract

In this thesis, a low-latency variant of speaker-independent deep clustering method is proposed for speaker separation. Compared to the offline deep clustering separation system, bidirectional long-short term memory networks (BLSTMs) are replaced with long-short term memory networks (LSTMs). The reason is that the data has to be fed to the BLSTM networks both forward and backward directions. Additionally, the final outputs depend on both directions, which make online processing not possible. Also, 32 ms synthesis window is replaced with 8 ms in order to cooperate with low- latency applications like hearing aids since the algorithmic latency depends upon the length of synthesis window. Furthermore, the beginning of the audio mixture,...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Online Speaker Separation Using Deep Clustering

Abstract

Extracted data

Online Speaker Separation Using Deep Clustering

Abstract

Extracted data

Related items

Related items