Voice based personal assistants are part of our daily lives. Their performance suffers in the presence of signal distortions, such as noise, reverberation, and competing speakers. This thesis addresses the problem of extracting the signal of interest in such challenging conditions by first localizing the target speaker and using the location to extract the target speech. In a first stage, a common situation is considered when the target speaker utters a known word or sentence such as the wake-up word of a distant-microphone voice command system. A method that exploits this text information in order to improve the speaker localization performance in the presence of competing speakers is proposed. The proposed solution uses a speech recogniti...
More and more devices we use in our daily life are embedded with one or more microphones so that the...
More and more devices we use in our daily life are embedded with one or more microphones so that the...
Some of the experiments presented in this manuscript were performed on Grid5000, a server supported ...
Les assistants vocaux font partie de notre vie quotidienne. Leurs performances sont mises à l'épreuv...
This work was conducted in the fast-growing context of hands-free voice command. In domestic environ...
This work was conducted in the fast-growing context of hands-free voice command. In domestic environ...
Cette thèse s'inscrit dans le contexte de l'essor des assistants vocaux mains libres. Dans un enviro...
Sound source localization (SSL) is a subtask of audio scene analysis that has challenged researchers...
Sound source localization (SSL) is a subtask of audio scene analysis that has challenged researchers...
Sound source localization (SSL) is a subtask of audio scene analysis that has challenged researchers...
Submitted to ICASSP 2020We investigate the effect of speaker localization on the performance of spee...
La localisation de sources sonores est une sous-tâche de l'analyse de scènes sonores qui a défié les...
International audienceSpeaker localization is a hard task, especially in adverse environmental condi...
International audienceSpeaker localization is a hard task, especially in adverse environmental condi...
This PhD falls within the development of hands-free telecommunication systems, more specifically sma...
More and more devices we use in our daily life are embedded with one or more microphones so that the...
More and more devices we use in our daily life are embedded with one or more microphones so that the...
Some of the experiments presented in this manuscript were performed on Grid5000, a server supported ...
Les assistants vocaux font partie de notre vie quotidienne. Leurs performances sont mises à l'épreuv...
This work was conducted in the fast-growing context of hands-free voice command. In domestic environ...
This work was conducted in the fast-growing context of hands-free voice command. In domestic environ...
Cette thèse s'inscrit dans le contexte de l'essor des assistants vocaux mains libres. Dans un enviro...
Sound source localization (SSL) is a subtask of audio scene analysis that has challenged researchers...
Sound source localization (SSL) is a subtask of audio scene analysis that has challenged researchers...
Sound source localization (SSL) is a subtask of audio scene analysis that has challenged researchers...
Submitted to ICASSP 2020We investigate the effect of speaker localization on the performance of spee...
La localisation de sources sonores est une sous-tâche de l'analyse de scènes sonores qui a défié les...
International audienceSpeaker localization is a hard task, especially in adverse environmental condi...
International audienceSpeaker localization is a hard task, especially in adverse environmental condi...
This PhD falls within the development of hands-free telecommunication systems, more specifically sma...
More and more devices we use in our daily life are embedded with one or more microphones so that the...
More and more devices we use in our daily life are embedded with one or more microphones so that the...
Some of the experiments presented in this manuscript were performed on Grid5000, a server supported ...