Substantial progress has been made in the field of natural language processing (NLP) due to the advent of large language models (LLMs)—deep neural networks with millions or billions of parameters pre-trained on large amounts of unlabeled data. However, these models have common weaknesses, including degenerate performance in data-scarce scenarios, and substantial computational resource requirements. This thesis aims to develop methods to address these limitations for improved applicability and performance of LLMs in resource-constrained settings with limited data and/or computational resources. To address the need for labeled data in data-scarce scenarios, I present two methods, in Chapter 2 and Chapter 3, respectively. The first method leve...
Supervised deep learning-based approaches have been applied to task-oriented dialog and have proven ...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
In the natural language processing (NLP) literature, neural networks are becoming increasingly deepe...
Large-scale annotated datasets are an indispensable ingredient of modern Natural Language Processing...
Title: Exploring Benefits of Transfer Learning in Neural Machine Translation Author: Tom Kocmi Depar...
The current generation of neural network-based natural language processing models excels at learning...
Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on...
Machine learning models cannot easily adapt to new domains and applications. This drawback becomes d...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Deep learning has fundamentally changed the landscape of natural language processing (NLP). The suc...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
NLP technologies are uneven for the world's languages as the state-of-the-art models are only availa...
Natural language processing (NLP) has come of age. For example, semantic role labeling (SRL), which ...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
This report documents the program and the outcomes of Dagstuhl Seminar 22232 “Efficient and Equitabl...
Supervised deep learning-based approaches have been applied to task-oriented dialog and have proven ...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
In the natural language processing (NLP) literature, neural networks are becoming increasingly deepe...
Large-scale annotated datasets are an indispensable ingredient of modern Natural Language Processing...
Title: Exploring Benefits of Transfer Learning in Neural Machine Translation Author: Tom Kocmi Depar...
The current generation of neural network-based natural language processing models excels at learning...
Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on...
Machine learning models cannot easily adapt to new domains and applications. This drawback becomes d...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Deep learning has fundamentally changed the landscape of natural language processing (NLP). The suc...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
NLP technologies are uneven for the world's languages as the state-of-the-art models are only availa...
Natural language processing (NLP) has come of age. For example, semantic role labeling (SRL), which ...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
This report documents the program and the outcomes of Dagstuhl Seminar 22232 “Efficient and Equitabl...
Supervised deep learning-based approaches have been applied to task-oriented dialog and have proven ...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
In the natural language processing (NLP) literature, neural networks are becoming increasingly deepe...