Abstract—This paper investigates robust privacy-sensitive au-dio features for speaker diarization in multiparty conversations: ie., a set of audio features having low linguistic information for speaker diarization in a single and multiple distant microphone scenarios. We systematically investigate Linear Prediction (LP) residual. Issues such as prediction order and choice of represen-tation of LP residual are studied. Additionally, we explore the combination of LP residual with subband information from 2.5 kHz to 3.5 kHz and spectral slope. Next, we propose a supervised framework using deep neural architecture for deriving privacy-sensitive audio features. We benchmark these approaches against the traditional Mel Frequency Cepstral Coeffici...
Sharing real-world speech utterances is key to the training and deployment of voice-based services. ...
Speaker diarisation addresses the question of 'who speaks when' in audio recordings, and has been st...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
Abstract—This paper investigates robust privacy-sensitive au-dio features for speaker diarization in...
We present a comprehensive study of linear prediction residual for speaker diarization on single and...
The goal of this paper is to investigate features for speech/nonspeech detection (SND) having low li...
Privacy is a fundamental aspect of human interactions and with the growing popularity of tracking an...
Speaker diarization finds contiguous speaker segments in an audio recording and clusters them by spe...
In this paper we investigate a set of privacy-sensitive audio features for speaker change detection ...
Privacy preservation has long been a concern in smart acoustic monitoring systems, where speech can ...
With widespread use of advanced technology for the recording, storing and sharing of social interact...
Personal audio logs are often recorded in multiple environments. This poses challenges for robust fr...
International audienceSharing real-world speech utterances is key to the training and deployment of ...
The goal in Speaker Diarization (SD) is to answer the question "Who spoke when?" for a given audio w...
We present privacy-sensitive methods for (1) automatically finding multi-person conversations in spo...
Sharing real-world speech utterances is key to the training and deployment of voice-based services. ...
Speaker diarisation addresses the question of 'who speaks when' in audio recordings, and has been st...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
Abstract—This paper investigates robust privacy-sensitive au-dio features for speaker diarization in...
We present a comprehensive study of linear prediction residual for speaker diarization on single and...
The goal of this paper is to investigate features for speech/nonspeech detection (SND) having low li...
Privacy is a fundamental aspect of human interactions and with the growing popularity of tracking an...
Speaker diarization finds contiguous speaker segments in an audio recording and clusters them by spe...
In this paper we investigate a set of privacy-sensitive audio features for speaker change detection ...
Privacy preservation has long been a concern in smart acoustic monitoring systems, where speech can ...
With widespread use of advanced technology for the recording, storing and sharing of social interact...
Personal audio logs are often recorded in multiple environments. This poses challenges for robust fr...
International audienceSharing real-world speech utterances is key to the training and deployment of ...
The goal in Speaker Diarization (SD) is to answer the question "Who spoke when?" for a given audio w...
We present privacy-sensitive methods for (1) automatically finding multi-person conversations in spo...
Sharing real-world speech utterances is key to the training and deployment of voice-based services. ...
Speaker diarisation addresses the question of 'who speaks when' in audio recordings, and has been st...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...