This paper tackles the problem of Out Of Vocabulary words in Automatic Speech Transcription applications for a compound language (Dutch). A seemingly attractive way to reduce the amount of OOV words in compound languages is to extend the AST system with a compound (de-)composition module. However, thus far, successful implementations of this approach are rather scarce. We developed a novel data driven compound (de-)composition module and tested it in two different AST experiments. For equal lexicon sizes, we see that our compound processor lowers the OOV rate. Moreover we are able to transform that gain in OOV rate into a reduction of the Word Error Rate of the transcription system. Using our approach we built a system with an 84K lexicon t...
In Technical Report No. 75 I proposed a method for describing compound words in Finnish. The aim in ...
Four experiments investigated the role of frequency information in compound production by independen...
Four experiments investigated the role of frequency information in compound production by independen...
This paper tackles the problem of Out Of Vocabulary words in Automatic Speech Transcription applicat...
This paper addresses compound splitting for Dutch in the context of broadcast news transcription. La...
This paper addresses compound splitting for Dutch in the context of broadcast news transcription. La...
Pelemans J., Demuynck K., Van hamme H., Wambacq P., ''Coping with language data sparsity: semantic h...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
© 2015 IEEE. Compounding is one of the most productive word formation processes in many languages an...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
This paper compares two approaches to lexical compound word reconstruction from a speech recognizer ...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
Compounding is one of the most productive word formation processes in many languages and is therefor...
Compounding, the process of combining several simplex words into a complex whole, is a pro-ductive p...
In Technical Report No. 75 I proposed a method for describing compound words in Finnish. The aim in ...
Four experiments investigated the role of frequency information in compound production by independen...
Four experiments investigated the role of frequency information in compound production by independen...
This paper tackles the problem of Out Of Vocabulary words in Automatic Speech Transcription applicat...
This paper addresses compound splitting for Dutch in the context of broadcast news transcription. La...
This paper addresses compound splitting for Dutch in the context of broadcast news transcription. La...
Pelemans J., Demuynck K., Van hamme H., Wambacq P., ''Coping with language data sparsity: semantic h...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
© 2015 IEEE. Compounding is one of the most productive word formation processes in many languages an...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
This paper compares two approaches to lexical compound word reconstruction from a speech recognizer ...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
Compounding is one of the most productive word formation processes in many languages and is therefor...
Compounding, the process of combining several simplex words into a complex whole, is a pro-ductive p...
In Technical Report No. 75 I proposed a method for describing compound words in Finnish. The aim in ...
Four experiments investigated the role of frequency information in compound production by independen...
Four experiments investigated the role of frequency information in compound production by independen...