FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français

ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks

Pelloin, Valentin
Dary, Franck
Hervé, Nicolas
Favre, Benoît
Camelin, Nathalie
Laurent, Antoine
Besacier, Laurent

September 2022

International audienceWe aim at improving spoken language modeling (LM) using very large amount of a...

Using ASR-Generated Text for Spoken Language Modeling

Hervé, Nicolas
Pelloin, Valentin
Favre, Benoît
Dary, Franck
Laurent, Antoine
Meignier, Sylvain
Besacier, Laurent

May 2022

International audienceThis papers aims at improving spoken language modeling (LM) using very large a...

LuxemBERT: Simple and Practical Data Augmentation in Language Model Pre-Training for Luxembourgish

Lothritz, Cedric
Lebichot, Bertrand
Allix, Kevin
Veiber, Lisa
Bissyande, Tegawendé François D Assise
Klein, Jacques
Boytsov, Andrey
Goujon, Anne
Lefebvre, Clément

June 2022

peer reviewedPre-trained Language Models such as BERT have become ubiquitous in NLP where they have ...

FlauBERT: Unsupervised Language Model Pre-training for French

Le, Hang
Vial, Loïc
Frej, Jibril
Segonne, Vincent
Coavoux, Maximin
Lecouteux, Benjamin
Allauzen, Alexandre
Crabbe, Benoit
Besacier, Laurent
Schwab, Didier

January 2020

International audienceLanguage models have become a key step to achieve state-of-the art results in ...

Évaluation et Production de Plongements de Mots à Partir de Contenus Web Français à Grande Échelle

Abdine, H
Xypolopoulos, C
Kamal Eddine, M
Vazirgiannis, M

June 2022

International audienceDistributed word representations are popularly used in many tasks in natural l...

CamemBERT: a Tasty French Language Model

Martin, Louis
Muller, Benjamin
Ortiz Suárez, Pedro Javier
Dupont, Yoann
Romary, Laurent
Villemonte de La Clergerie, Éric
Seddah, Djamé
Sagot, Benoît

October 2019

Web site: https://camembert-model.frPretrained language models are now ubiquitous in Natural Languag...

Data-Efficient French Language Modeling with CamemBERTa

Antoun, Wissam
Sagot, Benoît
Seddah, Djamé

August 2023

<p>Recent advances in NLP have significantly improved the performance of language models on a ...

Une approche basée sur les données pour le traitement automatique du langage naturel en français contemporain et historique

Ortiz Suarez, Pedro

June 2022

In recent years, neural methods for Natural Language Processing (NLP) have consistently and repeated...

BERTrade: Using Contextual Embeddings to Parse Old French

Grobol, Loïc
Regnault, Mathilde
Ortiz Suarez, Pedro
Sagot, Benoît
Romary, Laurent
Crabbé, Benoît

June 2022

International audienceThe successes of contextual word embeddings learned by training large-scale la...

PAGnol: An Extra-Large French Generative Model

Launay, Julien
Tommasone, Giuseppe Luca
Pannier, Baptiste
Boniface, François
Chatelain, Amélie
Cappelli, Alessandro
Poli, Iacopo
Seddah, Djamé

October 2021

Access to large pre-trained models of varied architectures, in many different languages, is central ...

Constitution et évaluation d'un jeu de données linguistiques en français pour l'analyse des fonctions lexicales encodées dans les modèles neuronaux de type FlauBERT

Bellue, Vincent

September 2020

Each language is made up of its own words. In most cases, these are polysemic, they have several mea...

ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks

Pelloin, Valentin
Dary, Franck
Herve, Nicolas
Favre, Benoit
Camelin, Nathalie
Laurent, Antoine
Besacier, Laurent

July 2022

We aim at improving spoken language modeling (LM) using very large amount of automatically transcrib...

Modèles neuronaux pour la modélisation statistique de la langue

Le, Hai Son

December 2012

The purpose of language models is in general to capture and to model regularities of language, there...

Évaluation des propriétés multilingues d'un embedding contextualisé

Gaschi, Félix
Joutard, Alexandre
Rastin, Parisa
Toussaint, Yannick

January 2022

International audienceDeep learning models like BERT, a stack of attention layers with an unsupervis...

Analyse syntaxique de l'ancien français : quelles propriétés de la langue influent le plus sur la qualité de l'apprentissage ?

Guibon, Gaël
Tellier, Isabelle
Prévost, Sophie
Constant, Mathieu
Gerdes, Kim

June 2015

International audienceOld French parsing : Which language properties have the greatest influence on ...

ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks

Pelloin, Valentin
Dary, Franck
Hervé, Nicolas
Favre, Benoît
Camelin, Nathalie
Laurent, Antoine
Besacier, Laurent

September 2022

International audienceWe aim at improving spoken language modeling (LM) using very large amount of a...

Using ASR-Generated Text for Spoken Language Modeling

Hervé, Nicolas
Pelloin, Valentin
Favre, Benoît
Dary, Franck
Laurent, Antoine
Meignier, Sylvain
Besacier, Laurent

May 2022

International audienceThis papers aims at improving spoken language modeling (LM) using very large a...

LuxemBERT: Simple and Practical Data Augmentation in Language Model Pre-Training for Luxembourgish

Lothritz, Cedric
Lebichot, Bertrand
Allix, Kevin
Veiber, Lisa
Bissyande, Tegawendé François D Assise
Klein, Jacques
Boytsov, Andrey
Goujon, Anne
Lefebvre, Clément

June 2022

peer reviewedPre-trained Language Models such as BERT have become ubiquitous in NLP where they have ...

FlauBERT: Unsupervised Language Model Pre-training for French

Le, Hang
Vial, Loïc
Frej, Jibril
Segonne, Vincent
Coavoux, Maximin
Lecouteux, Benjamin
Allauzen, Alexandre
Crabbe, Benoit
Besacier, Laurent
Schwab, Didier

January 2020

International audienceLanguage models have become a key step to achieve state-of-the art results in ...

Évaluation et Production de Plongements de Mots à Partir de Contenus Web Français à Grande Échelle

Abdine, H
Xypolopoulos, C
Kamal Eddine, M
Vazirgiannis, M

June 2022

International audienceDistributed word representations are popularly used in many tasks in natural l...

CamemBERT: a Tasty French Language Model

Martin, Louis
Muller, Benjamin
Ortiz Suárez, Pedro Javier
Dupont, Yoann
Romary, Laurent
Villemonte de La Clergerie, Éric
Seddah, Djamé
Sagot, Benoît

October 2019

Web site: https://camembert-model.frPretrained language models are now ubiquitous in Natural Languag...

Data-Efficient French Language Modeling with CamemBERTa

Antoun, Wissam
Sagot, Benoît
Seddah, Djamé

August 2023

<p>Recent advances in NLP have significantly improved the performance of language models on a ...

Une approche basée sur les données pour le traitement automatique du langage naturel en français contemporain et historique

Ortiz Suarez, Pedro

June 2022

In recent years, neural methods for Natural Language Processing (NLP) have consistently and repeated...

BERTrade: Using Contextual Embeddings to Parse Old French

Grobol, Loïc
Regnault, Mathilde
Ortiz Suarez, Pedro
Sagot, Benoît
Romary, Laurent
Crabbé, Benoît

June 2022

International audienceThe successes of contextual word embeddings learned by training large-scale la...

PAGnol: An Extra-Large French Generative Model

Launay, Julien
Tommasone, Giuseppe Luca
Pannier, Baptiste
Boniface, François
Chatelain, Amélie
Cappelli, Alessandro
Poli, Iacopo
Seddah, Djamé

October 2021

Access to large pre-trained models of varied architectures, in many different languages, is central ...

Constitution et évaluation d'un jeu de données linguistiques en français pour l'analyse des fonctions lexicales encodées dans les modèles neuronaux de type FlauBERT

Bellue, Vincent

September 2020

Each language is made up of its own words. In most cases, these are polysemic, they have several mea...

ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks

Pelloin, Valentin
Dary, Franck
Herve, Nicolas
Favre, Benoit
Camelin, Nathalie
Laurent, Antoine
Besacier, Laurent

July 2022

We aim at improving spoken language modeling (LM) using very large amount of automatically transcrib...

Modèles neuronaux pour la modélisation statistique de la langue

Le, Hai Son

December 2012

The purpose of language models is in general to capture and to model regularities of language, there...

Évaluation des propriétés multilingues d'un embedding contextualisé

Gaschi, Félix
Joutard, Alexandre
Rastin, Parisa
Toussaint, Yannick

January 2022

International audienceDeep learning models like BERT, a stack of attention layers with an unsupervis...

Analyse syntaxique de l'ancien français : quelles propriétés de la langue influent le plus sur la qualité de l'apprentissage ?

Guibon, Gaël
Tellier, Isabelle
Prévost, Sophie
Constant, Mathieu
Gerdes, Kim

June 2015

International audienceOld French parsing : Which language properties have the greatest influence on ...

ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks

Pelloin, Valentin
Dary, Franck
Hervé, Nicolas
Favre, Benoît
Camelin, Nathalie
Laurent, Antoine
Besacier, Laurent

September 2022

International audienceWe aim at improving spoken language modeling (LM) using very large amount of a...

Using ASR-Generated Text for Spoken Language Modeling

Hervé, Nicolas
Pelloin, Valentin
Favre, Benoît
Dary, Franck
Laurent, Antoine
Meignier, Sylvain
Besacier, Laurent

May 2022

International audienceThis papers aims at improving spoken language modeling (LM) using very large a...

LuxemBERT: Simple and Practical Data Augmentation in Language Model Pre-Training for Luxembourgish

Lothritz, Cedric
Lebichot, Bertrand
Allix, Kevin
Veiber, Lisa
Bissyande, Tegawendé François D Assise
Klein, Jacques
Boytsov, Andrey
Goujon, Anne
Lefebvre, Clément

June 2022

peer reviewedPre-trained Language Models such as BERT have become ubiquitous in NLP where they have ...

FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français

Abstract

Extracted data

FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français

Abstract

Extracted data

Related items

Related items