We present a comprehensive study of linear prediction residual for speaker diarization on single and multiple distant microphone conditions in privacy-sensitive settings, a requirement to analyze a wide range of spontaneous conversations. Two representations of the residual are compared, namely real-cepstrum and MFCC, with the latter performing better. Experiments on RT06eval show that residual with subband information from 2.5 kHz to 3.5 kHz and spectral slope yields a performance close to traditional MFCC features. As a way to objectively evaluate privacy in terms of linguistic information, we perform phoneme recognition. Residual features yield low phoneme accuracies compared to traditional MFCC features
In some situations, a user would like to communicate without detection. It has been shown that it is...
While public speech resources become increasingly available, there is a growing interest to preserve...
Voice user interfaces and digital assistants are rapidly entering our lives and becoming singular to...
This paper investigates robust privacy-sensitive audio features for speaker diarization in multipart...
The goal of this paper is to investigate features for speech/nonspeech detection (SND) having low li...
In this paper we investigate a set of privacy-sensitive audio features for speaker change detection ...
Personal audio logs are often recorded in multiple environments. This poses challenges for robust fr...
Privacy preservation has long been a concern in smart acoustic monitoring systems, where speech can ...
Voice assistive technologies have given rise to far-reaching privacy and security concerns. In this ...
Speaker recognition is applied in smart home devices, interactive voice response systems, call cente...
International audienceSharing real-world speech utterances is key to the training and deployment of ...
Sharing real-world speech utterances is key to the training and deployment of voice-based services. ...
Leakage of personal information in online conversations raises serious privacy concerns. For example...
We present privacy-sensitive methods for (1) automatically finding multi-person conversations in spo...
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deal...
In some situations, a user would like to communicate without detection. It has been shown that it is...
While public speech resources become increasingly available, there is a growing interest to preserve...
Voice user interfaces and digital assistants are rapidly entering our lives and becoming singular to...
This paper investigates robust privacy-sensitive audio features for speaker diarization in multipart...
The goal of this paper is to investigate features for speech/nonspeech detection (SND) having low li...
In this paper we investigate a set of privacy-sensitive audio features for speaker change detection ...
Personal audio logs are often recorded in multiple environments. This poses challenges for robust fr...
Privacy preservation has long been a concern in smart acoustic monitoring systems, where speech can ...
Voice assistive technologies have given rise to far-reaching privacy and security concerns. In this ...
Speaker recognition is applied in smart home devices, interactive voice response systems, call cente...
International audienceSharing real-world speech utterances is key to the training and deployment of ...
Sharing real-world speech utterances is key to the training and deployment of voice-based services. ...
Leakage of personal information in online conversations raises serious privacy concerns. For example...
We present privacy-sensitive methods for (1) automatically finding multi-person conversations in spo...
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deal...
In some situations, a user would like to communicate without detection. It has been shown that it is...
While public speech resources become increasingly available, there is a growing interest to preserve...
Voice user interfaces and digital assistants are rapidly entering our lives and becoming singular to...