International audienceWe aim at improving spoken language modeling (LM) using very large amount of automatically transcribed speech. We leverage the INA (French National Audiovisual Institute 1) collection and obtain 19GB of text after applying ASR on 350,000 hours of diverse TV shows. From this, spoken language models are trained either by fine-tuning an existing LM (FlauBERT 2) or through training a LM from scratch. New models (FlauBERT-Oral) are shared with the community and evaluated for 3 downstream tasks: spoken language understanding, classification of TV shows and speech syntactic parsing. Results show that FlauBERT-Oral can be beneficial compared to its initial FlauBERT version demonstrating that, despite its inherent noisy nature,...
Unsupervised acoustic modeling can offer a cost and time effective way of creating a solid acoustic ...
International audienceLanguage models have become a key step to achieve state-of-the art results in ...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
International audienceWe aim at improving spoken language modeling (LM) using very large amount of a...
International audienceThis papers aims at improving spoken language modeling (LM) using very large a...
We aim at improving spoken language modeling (LM) using very large amount of automatically transcrib...
This paper reports on the participation of FBK at the IWSLT 2011 Evaluation: namely in the English A...
In the spoken language translation pipeline, machine translation systems that are trained solely on ...
ition (ASR): precisely on both English and German ASR track. Only primary submissions have been sent...
This paper introduces a new corpus of read English speech, suitable for training and evaluating spee...
Language Models (LMs) represent a crucial component in the architecture of Automatic Speech Recognit...
International audienceLuxembourgish is embedded in a multilingual context on the divide between Roma...
Recently, there has been a rapidly increasing interest in using ASR for children’s language learning...
International audienceIntroduction & Motivation Language Models (LMs) represent a crucial component ...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
Unsupervised acoustic modeling can offer a cost and time effective way of creating a solid acoustic ...
International audienceLanguage models have become a key step to achieve state-of-the art results in ...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...
International audienceWe aim at improving spoken language modeling (LM) using very large amount of a...
International audienceThis papers aims at improving spoken language modeling (LM) using very large a...
We aim at improving spoken language modeling (LM) using very large amount of automatically transcrib...
This paper reports on the participation of FBK at the IWSLT 2011 Evaluation: namely in the English A...
In the spoken language translation pipeline, machine translation systems that are trained solely on ...
ition (ASR): precisely on both English and German ASR track. Only primary submissions have been sent...
This paper introduces a new corpus of read English speech, suitable for training and evaluating spee...
Language Models (LMs) represent a crucial component in the architecture of Automatic Speech Recognit...
International audienceLuxembourgish is embedded in a multilingual context on the divide between Roma...
Recently, there has been a rapidly increasing interest in using ASR for children’s language learning...
International audienceIntroduction & Motivation Language Models (LMs) represent a crucial component ...
Some practical uses of ASR have been implemented, including the transcription of meetings and the us...
Unsupervised acoustic modeling can offer a cost and time effective way of creating a solid acoustic ...
International audienceLanguage models have become a key step to achieve state-of-the art results in ...
Spoken language translation (SLT) exists within one of the most challenging intersections of speech ...