Spoken Document Retrieval (SDR) is usually implemented by using an Information Retrieval (IR) engine on speech transcripts that are produced by an Automatic Speech Recognition (ASR) system. These transcripts generally contain a substantial amount of transcription errors (noise) and are mostly unstructured. This thesis addresses two challenges that arise when doing IR on this type of source material: i. segmentation of speech transcripts into suitable retrieval units, and ii. evaluation of the impact of transcript noise on the results of an IR task.\ud It is shown that intrinsic evaluation results in different conclusions with regard to the quality of automatic story boundaries than when (extrinsic) Mean Average Precision (MAP) is used. This...
This paper describes the experiments performed as part of the TREC-97 Spoken Document Retrieval Trac...
International audienceIn many cases, textual information can be associated with speech signals such ...
This paper describes the spoken document retrieval system that we have been developing and assesses ...
This thesis introduces a novel framework for the evaluation of Automatic Speech Recognition (ASR) tr...
Speech recognition transcripts are being used in various fields of research and practical applicatio...
The dramatic increase in the creation of multimedia content is leading to the development of large a...
Abstract. This paper presents a series of analyses and experiments on spoken document retrieval syst...
This paper presents a series of analyses and experiments on spoken document retrieval systems: sear...
Advances in automatic speech recognition allow us to search large speech collections using tradition...
Information Retrieval systems determine relevance by comparing information needs with the content of...
Accessing information in multimedia databases encompasses a wide range of applications in which spo...
Spoken document retrieval is defined as information retrieval from transcribed spoken audio, and the...
This paper presents some developments in query expansion and document representation of our spoken d...
Within the context of international benchmarks and collection specific projects, much work on spoken...
Speech information retrieval seeks to facilitate retrieving and accessing spoken content. Speech ret...
This paper describes the experiments performed as part of the TREC-97 Spoken Document Retrieval Trac...
International audienceIn many cases, textual information can be associated with speech signals such ...
This paper describes the spoken document retrieval system that we have been developing and assesses ...
This thesis introduces a novel framework for the evaluation of Automatic Speech Recognition (ASR) tr...
Speech recognition transcripts are being used in various fields of research and practical applicatio...
The dramatic increase in the creation of multimedia content is leading to the development of large a...
Abstract. This paper presents a series of analyses and experiments on spoken document retrieval syst...
This paper presents a series of analyses and experiments on spoken document retrieval systems: sear...
Advances in automatic speech recognition allow us to search large speech collections using tradition...
Information Retrieval systems determine relevance by comparing information needs with the content of...
Accessing information in multimedia databases encompasses a wide range of applications in which spo...
Spoken document retrieval is defined as information retrieval from transcribed spoken audio, and the...
This paper presents some developments in query expansion and document representation of our spoken d...
Within the context of international benchmarks and collection specific projects, much work on spoken...
Speech information retrieval seeks to facilitate retrieving and accessing spoken content. Speech ret...
This paper describes the experiments performed as part of the TREC-97 Spoken Document Retrieval Trac...
International audienceIn many cases, textual information can be associated with speech signals such ...
This paper describes the spoken document retrieval system that we have been developing and assesses ...