We interpret the sound reaching our ears as the combined effect of independent, sound-producing entities in the external world; hearing would have limited usefulness if were defeated by overlapping sounds. Computer systems that are to interpret real-world sounds -- for speech recognition or for multimedia indexing -- must similarly interpret complex mixtures. However, existing functional models of audition employ only data-driven processing incapable of making context-dependent inferences in the face of interference. We propose a prediction-driven approach to this problem, raising numerous issues including the need to represent any kind of sound, and to handle multiple competing hypotheses. Results from an implementation of this approach il...
Tutorial on auditory scene analysis and source separation in humans and machines
The high levels goals of this thesis are to: understand the neural representation of sound, produce ...
This is the final version. Available on open access from Elsevier via the DOI in this recordData ava...
Computational auditory scene analysis — modeling the human ability to organize sound mixtures accord...
The sound of a busy environment, such as a city street, gives rise to a perception of numerous disti...
The field of computational auditory scene analysis (CASA) strives to build computer models of the hu...
I propose a structure for the first stage of a computer system capable of performing complex auditor...
Interprets speech recognition as a problem in Computational Auditory Scene Analysis, and discusses t...
Focuses on several different approaches to handling sound mixtures: computational auditory scene ana...
Introduction to auditory scene analysis and its computational modeling, to speech recognition in noi...
Computational Auditory Scene Analysis (CASA) is challenging problem for which many different approac...
An overview of the work of the Laboratory for Recognition and Organization of Speech and Audio, Depa...
Compared to traditional speech, music, or sound processing, the computational analysis of general au...
A major challenge for the auditory system is to disentangle signals emitted by two or more sound sou...
The acoustic environment surrounding us is extremely dynamic and unstructured in nature. Humans exhi...
Tutorial on auditory scene analysis and source separation in humans and machines
The high levels goals of this thesis are to: understand the neural representation of sound, produce ...
This is the final version. Available on open access from Elsevier via the DOI in this recordData ava...
Computational auditory scene analysis — modeling the human ability to organize sound mixtures accord...
The sound of a busy environment, such as a city street, gives rise to a perception of numerous disti...
The field of computational auditory scene analysis (CASA) strives to build computer models of the hu...
I propose a structure for the first stage of a computer system capable of performing complex auditor...
Interprets speech recognition as a problem in Computational Auditory Scene Analysis, and discusses t...
Focuses on several different approaches to handling sound mixtures: computational auditory scene ana...
Introduction to auditory scene analysis and its computational modeling, to speech recognition in noi...
Computational Auditory Scene Analysis (CASA) is challenging problem for which many different approac...
An overview of the work of the Laboratory for Recognition and Organization of Speech and Audio, Depa...
Compared to traditional speech, music, or sound processing, the computational analysis of general au...
A major challenge for the auditory system is to disentangle signals emitted by two or more sound sou...
The acoustic environment surrounding us is extremely dynamic and unstructured in nature. Humans exhi...
Tutorial on auditory scene analysis and source separation in humans and machines
The high levels goals of this thesis are to: understand the neural representation of sound, produce ...
This is the final version. Available on open access from Elsevier via the DOI in this recordData ava...