Indexing the spoken content of audio recordings requires the use of automatic speech recognition, which is as of today not reliable. Unlike indexing text, we cannot reliably know from a speech recognizer whether a word is present or not at a given point in the audio; we can only obtain a probability for it. Making correct use of these probabilities can significantly improve spoken-document search accuracy. First, we will describe how to improve accuracy for “web-search style ” (AND/phrase) queries into audio by utilizing speech recognition alternates (competing word hypotheses during recognition) and word posterior probabilities (confi-dence scores), based on word lattices. Then, we will present an end-to-end approach to doing so using stan...
This paper presents some developments in query expansion and document representation of our spoken d...
The support for typically out-of-vocabulary query terms such as names, acronyms, and foreign words i...
• Ever-increasing volumes of audio-visual content have been accumulated on the Internet and in the e...
Indexing the spoken content of audio recordings requires auto-matic speech recognition, which is as ...
Publicación ISIThe paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy repr...
Publicación ISIThe paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy repr...
Abstract. This paper describes a designed and implemented system for efficient storage, indexing and...
This thesis describes a designed and implemented system for efficient storage, indexing and search i...
The paper presents the Position Specific Posterior Lattice, a novel lossy representation of automati...
Searching for relevant material in documents containing transcription errors presents new challenges...
This paper describes a designed and implemented system for efficient storage, in-dexing and search i...
In this paper, we investigate a number of robust indexing and re-trieval methods in an effort to imp...
Searching for keywords in a collection of spoken documents is a challenging task. The use of Automat...
Abstract. This paper presents a series of analyses and experiments on spoken document retrieval syst...
Abstract. This paper presents a series of analyses and experiments on spoken document retrieval syst...
This paper presents some developments in query expansion and document representation of our spoken d...
The support for typically out-of-vocabulary query terms such as names, acronyms, and foreign words i...
• Ever-increasing volumes of audio-visual content have been accumulated on the Internet and in the e...
Indexing the spoken content of audio recordings requires auto-matic speech recognition, which is as ...
Publicación ISIThe paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy repr...
Publicación ISIThe paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy repr...
Abstract. This paper describes a designed and implemented system for efficient storage, indexing and...
This thesis describes a designed and implemented system for efficient storage, indexing and search i...
The paper presents the Position Specific Posterior Lattice, a novel lossy representation of automati...
Searching for relevant material in documents containing transcription errors presents new challenges...
This paper describes a designed and implemented system for efficient storage, in-dexing and search i...
In this paper, we investigate a number of robust indexing and re-trieval methods in an effort to imp...
Searching for keywords in a collection of spoken documents is a challenging task. The use of Automat...
Abstract. This paper presents a series of analyses and experiments on spoken document retrieval syst...
Abstract. This paper presents a series of analyses and experiments on spoken document retrieval syst...
This paper presents some developments in query expansion and document representation of our spoken d...
The support for typically out-of-vocabulary query terms such as names, acronyms, and foreign words i...
• Ever-increasing volumes of audio-visual content have been accumulated on the Internet and in the e...