This paper investigates robust privacy-sensitive audio features for speaker diarization in multiparty conversations: ie., a set of audio features having low linguistic information for speaker diarization in a single and multiple distant microphone scenarios. We systematically investigate Linear Prediction (LP) residual. Issues such as prediction order and choice of representation of LP residual are studied. Additionally, we explore the combination of LP residual with subband information from 2.5 kHz to 3.5 kHz and spectral slope. Next, we propose a supervised framework using deep neural architecture for deriving privacy-sensitive audio features. We benchmark these approaches against the traditional Mel Frequency Cepstral Coefficients (MFCC)...
Speaker recognition is applied in smart home devices, interactive voice response systems, call cente...
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deal...
With widespread use of advanced technology for the recording, storing and sharing of social interact...
Abstract—This paper investigates robust privacy-sensitive au-dio features for speaker diarization in...
We present a comprehensive study of linear prediction residual for speaker diarization on single and...
The goal of this paper is to investigate features for speech/nonspeech detection (SND) having low li...
In this paper we investigate a set of privacy-sensitive audio features for speaker change detection ...
Privacy preservation has long been a concern in smart acoustic monitoring systems, where speech can ...
Personal audio logs are often recorded in multiple environments. This poses challenges for robust fr...
Voice assistive technologies have given rise to far-reaching privacy and security concerns. In this ...
Privacy is a fundamental aspect of human interactions and with the growing popularity of tracking an...
International audienceSharing real-world speech utterances is key to the training and deployment of ...
Sharing real-world speech utterances is key to the training and deployment of voice-based services. ...
We present privacy-sensitive methods for (1) automatically finding multi-person conversations in spo...
Voice user interfaces and digital assistants are rapidly entering our lives and becoming singular to...
Speaker recognition is applied in smart home devices, interactive voice response systems, call cente...
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deal...
With widespread use of advanced technology for the recording, storing and sharing of social interact...
Abstract—This paper investigates robust privacy-sensitive au-dio features for speaker diarization in...
We present a comprehensive study of linear prediction residual for speaker diarization on single and...
The goal of this paper is to investigate features for speech/nonspeech detection (SND) having low li...
In this paper we investigate a set of privacy-sensitive audio features for speaker change detection ...
Privacy preservation has long been a concern in smart acoustic monitoring systems, where speech can ...
Personal audio logs are often recorded in multiple environments. This poses challenges for robust fr...
Voice assistive technologies have given rise to far-reaching privacy and security concerns. In this ...
Privacy is a fundamental aspect of human interactions and with the growing popularity of tracking an...
International audienceSharing real-world speech utterances is key to the training and deployment of ...
Sharing real-world speech utterances is key to the training and deployment of voice-based services. ...
We present privacy-sensitive methods for (1) automatically finding multi-person conversations in spo...
Voice user interfaces and digital assistants are rapidly entering our lives and becoming singular to...
Speaker recognition is applied in smart home devices, interactive voice response systems, call cente...
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deal...
With widespread use of advanced technology for the recording, storing and sharing of social interact...