The field of low-density NLP is often approached from an engineering perspective, and evaluations are typically haphazard - considering different architectures, given different languages, and different available resources - without a systematic comparison. The resulting architectures are then tested on the unique corpus and language for which this approach has been designed. This makes it difficult to truly evaluate which approach is truly the best, or which approaches are best for a given language. In this dissertation, several state-of-the-art architectures and approaches to low-density language Part-Of-Speech Tagging are reimplemented; all of these techniques exploit a relationship between a high-density (HD) language and a low-density...
Pretrained multilingual language models have become a common tool in transferring NLP capabilities t...
Language resources can be divided into structural resources treating phonology, morphosyntax, semant...
The languages that are most commonly subject to linguistic annotation on a large scale tend to be th...
The field of low-density NLP is often approached from an engineering perspective, and evaluations ar...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
International audienceWe propose a novel approach to cross-lingual part-of-speech tagging and depend...
Cross-lingual word embeddings are an increasingly important reseource in cross-lingual methods for N...
Jiawei ZhaoCurrent machine translation techniques were developed using predominantly rich resource l...
State-of-the-art approaches to most Natural Language Processing (NLP) tasks have achieved near huma...
International audienceWe present a survey covering the state of the art in low-resource machine tran...
We provide a systematic review of past studies that use multilingual data for text-to-speech (TTS) o...
Thesis (Ph.D.)--University of Washington, 2022Modern NLP systems have been highly successful at a wi...
As the Internet and World Wide Web have continued to gain widespread adoption, the linguistic divers...
Thesis (Master's)--University of Washington, 2014Dependency parsing is an important natural language...
Low density languages are typically viewed as those for which few language resources are available. ...
Pretrained multilingual language models have become a common tool in transferring NLP capabilities t...
Language resources can be divided into structural resources treating phonology, morphosyntax, semant...
The languages that are most commonly subject to linguistic annotation on a large scale tend to be th...
The field of low-density NLP is often approached from an engineering perspective, and evaluations ar...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
International audienceWe propose a novel approach to cross-lingual part-of-speech tagging and depend...
Cross-lingual word embeddings are an increasingly important reseource in cross-lingual methods for N...
Jiawei ZhaoCurrent machine translation techniques were developed using predominantly rich resource l...
State-of-the-art approaches to most Natural Language Processing (NLP) tasks have achieved near huma...
International audienceWe present a survey covering the state of the art in low-resource machine tran...
We provide a systematic review of past studies that use multilingual data for text-to-speech (TTS) o...
Thesis (Ph.D.)--University of Washington, 2022Modern NLP systems have been highly successful at a wi...
As the Internet and World Wide Web have continued to gain widespread adoption, the linguistic divers...
Thesis (Master's)--University of Washington, 2014Dependency parsing is an important natural language...
Low density languages are typically viewed as those for which few language resources are available. ...
Pretrained multilingual language models have become a common tool in transferring NLP capabilities t...
Language resources can be divided into structural resources treating phonology, morphosyntax, semant...
The languages that are most commonly subject to linguistic annotation on a large scale tend to be th...