Language Modeling (LM) is a complex task that has been preferably addressed with word level RNNs and, recently, attention-based models. LM at character level has shown inferior performance as it entails more complex training, but it also brings along several desirable conceptual modelling advantages such as no closed-vocabulary limitation and better exploitation of words' similarities. In this work we identify the main causes of the performance gap between word and character level LM and we propose solutions to address them. Our experiments make up considerable ground to word level models; we prove that the input can be processed at character level without losing anything in performance and that processing the output at character level res...
LSTMs and other RNN variants have shown strong performance on character-level language modeling. The...
We present a literature and empirical survey that critically assesses the state of the art in charac...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
© 2017 Association for Computational Linguistics. We present a Character-Word Long Short- Term Memor...
We describe a simple neural language model that relies only on character-level inputs. Predictions a...
15 page preprintWhat are the units of text that we want to model? From bytes to multi-word expressio...
Language modeling is a vast sub-field of natural language processing and this work focuses on solvin...
Within a situation where Semi-Supervised Learning (SSL) is available to exploit unlabeled data, this...
Modern language models mostly take sub-words as input, a design that balances the trade-off between ...
We describe a simple neural language model that re-lies only on character-level inputs. Predictions ...
International audienceRecent impressive improvements in NLP, largely based on the success of context...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
Language models are crucial for many tasks in NLP and N-grams are the best way to build them. Huge e...
Abstract—In this paper we investigate different n-gram language models that are defined over an open...
LSTMs and other RNN variants have shown strong performance on character-level language modeling. The...
We present a literature and empirical survey that critically assesses the state of the art in charac...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
© 2017 Association for Computational Linguistics. We present a Character-Word Long Short- Term Memor...
We describe a simple neural language model that relies only on character-level inputs. Predictions a...
15 page preprintWhat are the units of text that we want to model? From bytes to multi-word expressio...
Language modeling is a vast sub-field of natural language processing and this work focuses on solvin...
Within a situation where Semi-Supervised Learning (SSL) is available to exploit unlabeled data, this...
Modern language models mostly take sub-words as input, a design that balances the trade-off between ...
We describe a simple neural language model that re-lies only on character-level inputs. Predictions ...
International audienceRecent impressive improvements in NLP, largely based on the success of context...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
Language models are crucial for many tasks in NLP and N-grams are the best way to build them. Huge e...
Abstract—In this paper we investigate different n-gram language models that are defined over an open...
LSTMs and other RNN variants have shown strong performance on character-level language modeling. The...
We present a literature and empirical survey that critically assesses the state of the art in charac...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...