International audienceThis papers aims at improving spoken language modeling (LM) using very large amount of automatically transcribed speech. We leverage the INA (French National Audiovisual Institute 1) collection and obtain 19GB of text after applying ASR on 350,000 hours of diverse TV shows. From this, spoken language models are trained either by fine-tuning an existing LM (FlauBERT 2) or through training a LM from scratch. The new models (FlauBERT-Oral) are shared with the community 3 and are evaluated not only in terms of word prediction accuracy but also for two downstream tasks: classification of TV shows and syntactic parsing of speech. Experimental results show that FlauBERT-Oral is better than its initial FlauBERT version demonst...
Automatic speech recognition (ASR) requires a strong language model to guide the acoustic model and ...
International audienceIntroduction & Motivation Language Models (LMs) represent a crucial component ...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
International audienceThis papers aims at improving spoken language modeling (LM) using very large a...
International audienceWe aim at improving spoken language modeling (LM) using very large amount of a...
We aim at improving spoken language modeling (LM) using very large amount of automatically transcrib...
This paper reports on the participation of FBK at the IWSLT 2011 Evaluation: namely in the English A...
In the spoken language translation pipeline, machine translation systems that are trained solely on ...
ition (ASR): precisely on both English and German ASR track. Only primary submissions have been sent...
This paper introduces a new corpus of read English speech, suitable for training and evaluating spee...
Language Models (LMs) represent a crucial component in the architecture of Automatic Speech Recognit...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
Recently, there has been a rapidly increasing interest in using ASR for children’s language learning...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
International audienceLuxembourgish is embedded in a multilingual context on the divide between Roma...
Automatic speech recognition (ASR) requires a strong language model to guide the acoustic model and ...
International audienceIntroduction & Motivation Language Models (LMs) represent a crucial component ...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
International audienceThis papers aims at improving spoken language modeling (LM) using very large a...
International audienceWe aim at improving spoken language modeling (LM) using very large amount of a...
We aim at improving spoken language modeling (LM) using very large amount of automatically transcrib...
This paper reports on the participation of FBK at the IWSLT 2011 Evaluation: namely in the English A...
In the spoken language translation pipeline, machine translation systems that are trained solely on ...
ition (ASR): precisely on both English and German ASR track. Only primary submissions have been sent...
This paper introduces a new corpus of read English speech, suitable for training and evaluating spee...
Language Models (LMs) represent a crucial component in the architecture of Automatic Speech Recognit...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
Recently, there has been a rapidly increasing interest in using ASR for children’s language learning...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
International audienceLuxembourgish is embedded in a multilingual context on the divide between Roma...
Automatic speech recognition (ASR) requires a strong language model to guide the acoustic model and ...
International audienceIntroduction & Motivation Language Models (LMs) represent a crucial component ...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...