This paper presents a new speaker change detection system based on Long Short-Term Memory (LSTM) neural networks using acoustic data and linguistic content. Language modelling is combined with two different Joint Factor Analysis (JFA) acoustic approaches: i-vectors and speaker factors. Both of them are compared with a baseline algorithm that uses cosine distance to detect speaker turn changes. LSTM neural networks with both linguistic and acoustic features have been able to produce a robust speaker segmentation. The experimental results show that our proposal clearly outperforms the baseline system.Peer Reviewe
Abstract. Speaker change detection is important for automatic segmentation of multispeaker speech da...
© 2018 International Speech Communication Association. All rights reserved. With deep learning appro...
The detection of overlapping speech segments is of key importance in speech applications involving a...
This paper presents a new speaker change detection system based on Long Short-Term Memory (L...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Long Short-Term Memory (LSTM) is a recurrent neural net-work (RNN) architecture specializing in mode...
In this paper we propose a method for speaker change detection using features of excitation source o...
In this paper a novel speaker recognition system is introduced. Automated speaker recognition has be...
This work explores the use of Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) for aut...
We investigate classification of non-linguistic vocalisations with a novel audiovisual approach and ...
© 2016 IEEE. In recent years, research on language modeling for speech recognition has increasingly ...
Automatic speech processing is an active field of research since the 1950s. Within this field the ma...
ABSTRAKSI: Segmentasi sinyal suara adalah upaya untuk melakukan pengklasifikasian sinyal suara berda...
International audienceThis paper investigates speaker adaptation techniques for bidirectional long ...
Abstract. Speaker change detection is important for automatic segmentation of multispeaker speech da...
© 2018 International Speech Communication Association. All rights reserved. With deep learning appro...
The detection of overlapping speech segments is of key importance in speech applications involving a...
This paper presents a new speaker change detection system based on Long Short-Term Memory (L...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Long Short-Term Memory (LSTM) is a recurrent neural net-work (RNN) architecture specializing in mode...
In this paper we propose a method for speaker change detection using features of excitation source o...
In this paper a novel speaker recognition system is introduced. Automated speaker recognition has be...
This work explores the use of Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) for aut...
We investigate classification of non-linguistic vocalisations with a novel audiovisual approach and ...
© 2016 IEEE. In recent years, research on language modeling for speech recognition has increasingly ...
Automatic speech processing is an active field of research since the 1950s. Within this field the ma...
ABSTRAKSI: Segmentasi sinyal suara adalah upaya untuk melakukan pengklasifikasian sinyal suara berda...
International audienceThis paper investigates speaker adaptation techniques for bidirectional long ...
Abstract. Speaker change detection is important for automatic segmentation of multispeaker speech da...
© 2018 International Speech Communication Association. All rights reserved. With deep learning appro...
The detection of overlapping speech segments is of key importance in speech applications involving a...