In this work we address the speaker verification task in domestic environments where multiple rooms are monitored by a set of distributed microphones. In particular, we focus on the mismatch between the training of the total variability feature extraction hyper-parameters, the enrolment stage, which occurs at a fixed position in the home, and the test phase which could happen in any location of the apartment. Building upon a previous work, where a position independent multi-channel verification system was introduced, we investigate different i-vector combination strategies to attenuate the effects of the above mentioned mismatch sources. The proposed methods implicitly select the microphones in the room where the speaker is, without any kno...
Abstract—Human–computer interaction (HCI) using speech communication is becoming increasingly import...
This thesis describes the development of a robust automatic speaker verification system (ASV) with s...
When several acoustic sources are simultaneously active in a meeting room scenario, and both the pos...
In this work we address the speaker verification task in domestic environments, monitored by multipl...
While considerable work has been done to characterize the detrimental effects of channel variability...
Abstract—It is known that channel variability compromises automatic speaker recognition accuracy. Ho...
Domestic environments are particularly challenging for distant speech recognition: reverberation, b...
In this work we focus on speaker verification on channels of varying quality, namely Skype and high ...
In this paper, we examine the challenging problem of detecting acoustic events and voice activity in...
Automatic speech recognition in a room with distant microphones is strongly affected by noise and re...
This paper addresses the problem of speaker verification under reverberant conditions, using only th...
We explore how intrinsic variations (those associated with the speaker rather than the recording env...
International audienceToday's smart devices using speaker verification are getting equipped with mul...
Conventional microphone array implementations aim to lock onto a source with given location and if r...
In this work, we deal with the analysis and comparison of information combinations of multi-channel ...
Abstract—Human–computer interaction (HCI) using speech communication is becoming increasingly import...
This thesis describes the development of a robust automatic speaker verification system (ASV) with s...
When several acoustic sources are simultaneously active in a meeting room scenario, and both the pos...
In this work we address the speaker verification task in domestic environments, monitored by multipl...
While considerable work has been done to characterize the detrimental effects of channel variability...
Abstract—It is known that channel variability compromises automatic speaker recognition accuracy. Ho...
Domestic environments are particularly challenging for distant speech recognition: reverberation, b...
In this work we focus on speaker verification on channels of varying quality, namely Skype and high ...
In this paper, we examine the challenging problem of detecting acoustic events and voice activity in...
Automatic speech recognition in a room with distant microphones is strongly affected by noise and re...
This paper addresses the problem of speaker verification under reverberant conditions, using only th...
We explore how intrinsic variations (those associated with the speaker rather than the recording env...
International audienceToday's smart devices using speaker verification are getting equipped with mul...
Conventional microphone array implementations aim to lock onto a source with given location and if r...
In this work, we deal with the analysis and comparison of information combinations of multi-channel ...
Abstract—Human–computer interaction (HCI) using speech communication is becoming increasingly import...
This thesis describes the development of a robust automatic speaker verification system (ASV) with s...
When several acoustic sources are simultaneously active in a meeting room scenario, and both the pos...