Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.Cataloged from PDF version of thesis.Includes bibliographical references (p. 103-106).The problem of keyword spotting in audio data has been explored for many years. Typically researchers use supervised methods to train statistical models to detect keyword instances. However, such supervised methods require large quantities of annotated data that is unlikely to be available for the majority of languages in the world. This thesis addresses this lack-of-annotation problem and presents two completely unsupervised spoken keyword spotting systems that do not require any transcribed data. In the first system, a Gaussian Mixture Model i...
Current supervised speech technology relies heavily on tran-scribed speech and pronunciation diction...
This paper investigates detection of English keywords in a conver-sational scenario using a combinat...
State of the art technologies for speech recognition are very accurate for heavily studied languages...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Tesis doctoral inédita. Universidad Autónoma de Madrid, Escuela Politécnica Superior, marzo de 200
AbstractKeyword spotting refers to the detection of a limited number of given keywords in speech utt...
This paper presents a system for keyword detection in spontaneous speech. Keywords are predefined th...
This study presents a novel zero-shot user-defined keyword spotting model that utilizes the audio-ph...
This paper investigates detection of English keywords in a conversational scenario using a combinati...
In this paper, we further investigate the large vocabulary continuous speech recognition approach to...
Abstract. This paper describes several approaches to keyword spotting (KWS) for informal continuous ...
In this paper, we propose a novel end-to-end user-defined keyword spotting method that utilizes ling...
We address the problem of retrieving spoken information from noisy and heterogeneous audio archives ...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
This thesis presents work in research topics of audio detection. It first describes a system for lar...
Current supervised speech technology relies heavily on tran-scribed speech and pronunciation diction...
This paper investigates detection of English keywords in a conver-sational scenario using a combinat...
State of the art technologies for speech recognition are very accurate for heavily studied languages...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Tesis doctoral inédita. Universidad Autónoma de Madrid, Escuela Politécnica Superior, marzo de 200
AbstractKeyword spotting refers to the detection of a limited number of given keywords in speech utt...
This paper presents a system for keyword detection in spontaneous speech. Keywords are predefined th...
This study presents a novel zero-shot user-defined keyword spotting model that utilizes the audio-ph...
This paper investigates detection of English keywords in a conversational scenario using a combinati...
In this paper, we further investigate the large vocabulary continuous speech recognition approach to...
Abstract. This paper describes several approaches to keyword spotting (KWS) for informal continuous ...
In this paper, we propose a novel end-to-end user-defined keyword spotting method that utilizes ling...
We address the problem of retrieving spoken information from noisy and heterogeneous audio archives ...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
This thesis presents work in research topics of audio detection. It first describes a system for lar...
Current supervised speech technology relies heavily on tran-scribed speech and pronunciation diction...
This paper investigates detection of English keywords in a conver-sational scenario using a combinat...
State of the art technologies for speech recognition are very accurate for heavily studied languages...