International audienceRecognition of Proper Names (PNs) in speech is important for content based indexing and browsing of audio-video data.However, many PNs are Out-Of-Vocabulary (OOV) words nfor LVCSR systems used in these applications due to the diachronicnature of data. By exploiting semantic context of the audio, relevant OOV PNs can be retrieved and then the target PNs can be recovered. To retrieve OOV PNs, we propose to represent their context with document level semantic vectors; and show that this approach is able to handle less frequent OOV PNs in the training data. We study different representations, including Random Projections, LSA, LDA, Skip-gram, CBOW and GloVe. A further evaluation of recovery of target OOV PNs using a phonet...
International audienceDeveloping high-quality transcription systems for very large vocabulary corpor...
International audienceOne important issue of speech recognition systems is Out-of Vocabulary words (...
International audienceOut-of-vocabulary (OOV) words can pose a particular problem for automatic spee...
International audienceThe diachronic nature of broadcast news data leads to the problem of Out-Of-Vo...
International audienceMany Proper Names (PNs) are Out-Of-Vocabulary (OOV) words for speech recogniti...
International audienceRetrieving Proper Names (PNs) relevant to an audio documentcan improve speech ...
International audienceThis paper deals with the problem of high-quality transcription systems for ve...
International audienceDespite recent progress in developing Large Vocabulary Continuous Speech Recog...
The diachronic nature of broadcast news causes frequent variations in the linguisticcontent and voca...
International audienceProper name recognition is a challenging task in information retrieval from la...
International audienceDeveloping high-quality transcription systems for very large vocabulary corpor...
International audienceProper name recognition is a challenging task in information retrieval in larg...
International audienceProper names are usually key to understanding the information contained in a d...
International audienceThe problem of out-of-vocabulary words, more precisely proper names retrieval ...
International audienceOne important issue of speech recognition systems is Out-of Vocabulary words (...
International audienceDeveloping high-quality transcription systems for very large vocabulary corpor...
International audienceOne important issue of speech recognition systems is Out-of Vocabulary words (...
International audienceOut-of-vocabulary (OOV) words can pose a particular problem for automatic spee...
International audienceThe diachronic nature of broadcast news data leads to the problem of Out-Of-Vo...
International audienceMany Proper Names (PNs) are Out-Of-Vocabulary (OOV) words for speech recogniti...
International audienceRetrieving Proper Names (PNs) relevant to an audio documentcan improve speech ...
International audienceThis paper deals with the problem of high-quality transcription systems for ve...
International audienceDespite recent progress in developing Large Vocabulary Continuous Speech Recog...
The diachronic nature of broadcast news causes frequent variations in the linguisticcontent and voca...
International audienceProper name recognition is a challenging task in information retrieval from la...
International audienceDeveloping high-quality transcription systems for very large vocabulary corpor...
International audienceProper name recognition is a challenging task in information retrieval in larg...
International audienceProper names are usually key to understanding the information contained in a d...
International audienceThe problem of out-of-vocabulary words, more precisely proper names retrieval ...
International audienceOne important issue of speech recognition systems is Out-of Vocabulary words (...
International audienceDeveloping high-quality transcription systems for very large vocabulary corpor...
International audienceOne important issue of speech recognition systems is Out-of Vocabulary words (...
International audienceOut-of-vocabulary (OOV) words can pose a particular problem for automatic spee...