International audienceMeetings are a common activity in professional contexts, and it remains challenging to endow vocal assistants with advanced functionalities to facilitate meeting management. In this context, a task like active speaker detection can provide useful insights to model interaction between meeting participants. Detection of the active speaker can be performed using only video based on the movements of the participants of a meeting. Depending on the assistant design and each participant position regarding the device, active speaker detection can benefit from information coming from visual and audio modalities. Motivated by our application context related to advanced meeting assistant, we want to combine audio and visual infor...
This paper presents a self-supervised method for visual detection of the active speaker in a multi-p...
International audienceIt is now well established from a variety of studies that there is a significa...
© 2015 ACM. Active speakers have traditionally been identified in video by detecting their moving li...
International audienceMeetings are a common activity in professional contexts, and it remains challe...
International audienceMeetings are a common activity in professional contexts, and it remains challe...
Human can extract speech signals that they need to understand from a mixture of background noise, in...
—Active speaker detection (ASD) is a multi-modal task that aims to identify who, if anyone, is speak...
Human beings have developed fantastic abilities to integrate information from various sensory sourc...
In this paper we address the problem of estimating who is speak-ing from automatically extracted low...
Many previous audio-visual voice-related works focus on speech, ignoring the singing voice in the gr...
© 2016 ACM. In this work, we show how to co-Train a classifier for active speaker detection using au...
Chakravarty P., Zegers J., Tuytelaars T., Van hamme H., ''Active speaker detection with audio-visual...
This paper presents a self-supervised method for visual detection of the active speaker in a multi-p...
This paper presents a self-supervised method for visual detection of the active speaker in a multi-p...
This paper presents a self-supervised method for visual detection of the active speaker in a multi-p...
This paper presents a self-supervised method for visual detection of the active speaker in a multi-p...
International audienceIt is now well established from a variety of studies that there is a significa...
© 2015 ACM. Active speakers have traditionally been identified in video by detecting their moving li...
International audienceMeetings are a common activity in professional contexts, and it remains challe...
International audienceMeetings are a common activity in professional contexts, and it remains challe...
Human can extract speech signals that they need to understand from a mixture of background noise, in...
—Active speaker detection (ASD) is a multi-modal task that aims to identify who, if anyone, is speak...
Human beings have developed fantastic abilities to integrate information from various sensory sourc...
In this paper we address the problem of estimating who is speak-ing from automatically extracted low...
Many previous audio-visual voice-related works focus on speech, ignoring the singing voice in the gr...
© 2016 ACM. In this work, we show how to co-Train a classifier for active speaker detection using au...
Chakravarty P., Zegers J., Tuytelaars T., Van hamme H., ''Active speaker detection with audio-visual...
This paper presents a self-supervised method for visual detection of the active speaker in a multi-p...
This paper presents a self-supervised method for visual detection of the active speaker in a multi-p...
This paper presents a self-supervised method for visual detection of the active speaker in a multi-p...
This paper presents a self-supervised method for visual detection of the active speaker in a multi-p...
International audienceIt is now well established from a variety of studies that there is a significa...
© 2015 ACM. Active speakers have traditionally been identified in video by detecting their moving li...