Audio-visual source localization is a challenging task that aims to predict the location of visual sound sources in a video. Since collecting ground-truth annotations of sounding objects can be costly, a plethora of weakly-supervised localization methods that can learn from datasets with no bounding-box annotations have been proposed in recent years, by leveraging the natural co-occurrence of audio and visual signals. Despite significant interest, popular evaluation protocols have two major flaws. First, they allow for the use of a fully annotated dataset to perform early stopping, thus significantly increasing the annotation effort required for training. Second, current evaluation metrics assume the presence of sound sources at all times. ...
Abstract—This paper addresses the problem of localizing audio sources using binaural measurements. W...
—Training a robust tracker of objects (such as vehicles and people) using audio and visual informati...
Psychophysical and physiological evidence shows that sound local-ization of acoustic signals is stro...
Learning to localize the sound source in videos without explicit annotations is a novel area of audi...
International audienceHumans can easily recognize where and how the sound is produced via watching a...
Visual events are usually accompanied by sounds in our daily lives. However, can the machines learn ...
We propose to explore a new problem called audio-visual segmentation (AVS), in which the goal is to ...
We present a method for simultaneously localizing multiple sound sources within a visual scene. This...
In this paper, we perform audio-visual sound source separation, i.e. to separate component audios fr...
In this paper, we investigate techniques to localize the sound source in video made using one microp...
Early computational approaches for sound source localization, originating in robotics, were modeled ...
This paper focuses on the weakly-supervised audio-visual video parsing task, which aims to recognize...
The ability to localize visual objects that are associated with an audio source and at the same time...
Deep learning has fueled an explosion of applications, yet training deep neural networks usually req...
Low-frequency sound source localization generates considerable amount of disagreement between audio/...
Abstract—This paper addresses the problem of localizing audio sources using binaural measurements. W...
—Training a robust tracker of objects (such as vehicles and people) using audio and visual informati...
Psychophysical and physiological evidence shows that sound local-ization of acoustic signals is stro...
Learning to localize the sound source in videos without explicit annotations is a novel area of audi...
International audienceHumans can easily recognize where and how the sound is produced via watching a...
Visual events are usually accompanied by sounds in our daily lives. However, can the machines learn ...
We propose to explore a new problem called audio-visual segmentation (AVS), in which the goal is to ...
We present a method for simultaneously localizing multiple sound sources within a visual scene. This...
In this paper, we perform audio-visual sound source separation, i.e. to separate component audios fr...
In this paper, we investigate techniques to localize the sound source in video made using one microp...
Early computational approaches for sound source localization, originating in robotics, were modeled ...
This paper focuses on the weakly-supervised audio-visual video parsing task, which aims to recognize...
The ability to localize visual objects that are associated with an audio source and at the same time...
Deep learning has fueled an explosion of applications, yet training deep neural networks usually req...
Low-frequency sound source localization generates considerable amount of disagreement between audio/...
Abstract—This paper addresses the problem of localizing audio sources using binaural measurements. W...
—Training a robust tracker of objects (such as vehicles and people) using audio and visual informati...
Psychophysical and physiological evidence shows that sound local-ization of acoustic signals is stro...