Language models (LMs) are an essential element in statistical approaches to natural language processing for tasks such as speech recognition and machine translation (MT). The advent of big data leads to the availability of massive amounts of data to build LMs, and in fact, for the most prominent languages, using current techniques and hardware, it is not feasible to train LMs with all the data available nowadays. At the same time, it has been shown that the more data is used for a LM the better the performance, e.g. for MT, without any indication yet of reaching a plateau. This paper presents CloudLM, an open-source cloud-based LM intended for MT, which allows to query distributed LMs. CloudLM relies on Apache Solr and provides the function...
Text corpus size is an important issue when building a language model (LM). This is a particularly i...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
This paper describes the team (“Tamalli”)’s submission to AmericasNLP2021 shared task on Open Machin...
Language models (LMs) are an essential element in statistical approaches to natural language process...
Language models (LMs) are an essential element in statistical approaches to natural language process...
N-gram language models are an essential component in statistical natural language processing systems...
Research in speech recognition and machine translation is boosting the use of large scale n-gram lan...
In this paper we share findings from our effort to build practical machine translation (MT) systems ...
In this paper, we present the architecture of a distributed resource repository developed for collec...
Natural language processing of Low-Resource Languages (LRL) is often challenged by the lack of data....
This paper reports on the benefits of largescale statistical language modeling in machine translatio...
Statistical machine translation, as well as other areas of human language processing, have recentl...
Generative Large Language Models (LLMs) have achieved remarkable advancements in various NLP tasks. ...
Machine translation is the discipline concerned with developing automated tools for translating fro...
Large language models (LLMs) implicitly learn to perform a range of language tasks, including machin...
Text corpus size is an important issue when building a language model (LM). This is a particularly i...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
This paper describes the team (“Tamalli”)’s submission to AmericasNLP2021 shared task on Open Machin...
Language models (LMs) are an essential element in statistical approaches to natural language process...
Language models (LMs) are an essential element in statistical approaches to natural language process...
N-gram language models are an essential component in statistical natural language processing systems...
Research in speech recognition and machine translation is boosting the use of large scale n-gram lan...
In this paper we share findings from our effort to build practical machine translation (MT) systems ...
In this paper, we present the architecture of a distributed resource repository developed for collec...
Natural language processing of Low-Resource Languages (LRL) is often challenged by the lack of data....
This paper reports on the benefits of largescale statistical language modeling in machine translatio...
Statistical machine translation, as well as other areas of human language processing, have recentl...
Generative Large Language Models (LLMs) have achieved remarkable advancements in various NLP tasks. ...
Machine translation is the discipline concerned with developing automated tools for translating fro...
Large language models (LLMs) implicitly learn to perform a range of language tasks, including machin...
Text corpus size is an important issue when building a language model (LM). This is a particularly i...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
This paper describes the team (“Tamalli”)’s submission to AmericasNLP2021 shared task on Open Machin...