In this paper we investigate a set of privacy-sensitive audio features for speaker change detection (SCD) in multiparty conversations. These features are based on three different principles: characterizing the excitation source information using linear prediction residual, characterizing subband spectral information shown to contain speaker information, and characterizing the general shape of the spectrum. Experiments show that the performance of the privacy-sensitive features is comparable or better than that of the state-of-the-art full-band spectral-based features, namely, mel frequency cepstral coefficients, which suggests that socially acceptable ways of recording conversations in real-life is feasible
Voice interfaces continue to grow in popularity, with standalone systems being deployed in our homes...
Audio-based sensing enables fine-grained human activity detection, such as sensing hand gestures and...
A smart speaker's onboarding process is extremely important because it is the user's first touchpoin...
The goal of this paper is to investigate features for speech/nonspeech detection (SND) having low li...
This paper investigates robust privacy-sensitive audio features for speaker diarization in multipart...
We present a comprehensive study of linear prediction residual for speaker diarization on single and...
Personal audio logs are often recorded in multiple environments. This poses challenges for robust fr...
We present privacy-sensitive methods for (1) automatically finding multi-person conversations in spo...
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deal...
In this paper we propose a method for speaker change detection using features of excitation source o...
AbstractIn this paper, we describe about some privacy protection techniques for speech signals captu...
Privacy preservation has long been a concern in smart acoustic monitoring systems, where speech can ...
Speech is our main method of communication that allows us to intuitively communicate complex ideas a...
With the growing popularity of social networks, cloud services and online applications, people are b...
In this paper we propose an unsupervised speaker change detection (SCD) system developed for mobile ...
Voice interfaces continue to grow in popularity, with standalone systems being deployed in our homes...
Audio-based sensing enables fine-grained human activity detection, such as sensing hand gestures and...
A smart speaker's onboarding process is extremely important because it is the user's first touchpoin...
The goal of this paper is to investigate features for speech/nonspeech detection (SND) having low li...
This paper investigates robust privacy-sensitive audio features for speaker diarization in multipart...
We present a comprehensive study of linear prediction residual for speaker diarization on single and...
Personal audio logs are often recorded in multiple environments. This poses challenges for robust fr...
We present privacy-sensitive methods for (1) automatically finding multi-person conversations in spo...
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deal...
In this paper we propose a method for speaker change detection using features of excitation source o...
AbstractIn this paper, we describe about some privacy protection techniques for speech signals captu...
Privacy preservation has long been a concern in smart acoustic monitoring systems, where speech can ...
Speech is our main method of communication that allows us to intuitively communicate complex ideas a...
With the growing popularity of social networks, cloud services and online applications, people are b...
In this paper we propose an unsupervised speaker change detection (SCD) system developed for mobile ...
Voice interfaces continue to grow in popularity, with standalone systems being deployed in our homes...
Audio-based sensing enables fine-grained human activity detection, such as sensing hand gestures and...
A smart speaker's onboarding process is extremely important because it is the user's first touchpoin...