This paper proposes a method recovering audio-visual syn-chronization of multimedia content. It exploits the correlation between the acoustic and the visual signals in order to esti-mate the audio-visual drift existing in the content. By shifting the audio signal relative to the visual signal, the estimation of the drift is obtained by searching for the shift producing the maximal audio-visual correlation. We consider two cor-relation measures, namely, mutual information and canonical correlation, and compare their performance. Experimental re-sults demonstrate that the method using the canonical corre-lation is effective in recovering the audio-visual synchroniza-tion for both speech and non-speech sequences. Index Terms — Audio-visual syn...
The objective of this paper is audio-visual synchronisation of general videos 'in the wild'. For suc...
The objective of this paper is audio-visual synchronisation of general videos ‘in the wild&rsq...
The objective of this paper is audio-visual synchronisation of general videos 'in the wild'. For suc...
This thesis presents a computational framework to jointly analyze auditory and visual information. T...
We describe a novel approach for determining the audio-visual synchrony of a monologue video sequenc...
In this paper, we address the problem of automatic discovery of speech patterns using audio-visual i...
The world is a fundamentally multisensory place, with many physical events generating signals that c...
In our approach, we aim at an objective measurement of synchrony in multimodal behavior. The use of ...
Music videos are good examples of multimedia documents in which the structures of the audio and vide...
Traditional synchronization schemes of multimedia applications are based on temporal relationships b...
The audio and video synchronization plays an important role in speech recognition and multimedia com...
Performances of synchronous imagery and music have existed for almost 300 years, but they have becom...
The objective of this paper is audio-visual synchronisation of general videos 'in the wild'. For suc...
Given the proliferation of consumer media recording de-vices, events often give rise to a large numb...
Abstract—We propose a framework based on signatures ex-tracted from audio and video streams for auto...
The objective of this paper is audio-visual synchronisation of general videos 'in the wild'. For suc...
The objective of this paper is audio-visual synchronisation of general videos ‘in the wild&rsq...
The objective of this paper is audio-visual synchronisation of general videos 'in the wild'. For suc...
This thesis presents a computational framework to jointly analyze auditory and visual information. T...
We describe a novel approach for determining the audio-visual synchrony of a monologue video sequenc...
In this paper, we address the problem of automatic discovery of speech patterns using audio-visual i...
The world is a fundamentally multisensory place, with many physical events generating signals that c...
In our approach, we aim at an objective measurement of synchrony in multimodal behavior. The use of ...
Music videos are good examples of multimedia documents in which the structures of the audio and vide...
Traditional synchronization schemes of multimedia applications are based on temporal relationships b...
The audio and video synchronization plays an important role in speech recognition and multimedia com...
Performances of synchronous imagery and music have existed for almost 300 years, but they have becom...
The objective of this paper is audio-visual synchronisation of general videos 'in the wild'. For suc...
Given the proliferation of consumer media recording de-vices, events often give rise to a large numb...
Abstract—We propose a framework based on signatures ex-tracted from audio and video streams for auto...
The objective of this paper is audio-visual synchronisation of general videos 'in the wild'. For suc...
The objective of this paper is audio-visual synchronisation of general videos ‘in the wild&rsq...
The objective of this paper is audio-visual synchronisation of general videos 'in the wild'. For suc...