The goal of this thesis is to improve the feasibility of building applied NLP systems for more diverse and niche real-world use-cases of extracting structured information from text. A core factor in determining this feasibility is the cost of manually annotating enough unbiased labeled data to achieve a desired level of system accuracy, and our goal is to reduce this cost. We focus on reducing this cost by making contributions in two directions: (1) easing the annotation burden by leveraging high-level expert knowledge in addition to labeled examples, thus making approaches more annotation-efficient; and (2) mitigating known biases in cheaper, imperfectly labeled real-world datasets so that we may use them to our advantage. A central theme ...
Many NLP applications require manual text annotations for a variety of tasks, notably to train class...
In recent decades, the society depends more and more on computers for a large number of tasks. The f...
Many NLP applications require manual text annotations for a variety of tasks, notably to train class...
Natural Language Processing (NLP) is a sub-field of Artificial Intelligence and Linguistics, with th...
Crowdsourcing platforms are often used to collect datasets for training machine learning models, des...
The best performing NLP models to date are learned from large volumes of manually-annotated data. Fo...
Deep Learning, a growing sub-field of machine learning, has been applied with tremendous success in ...
Deep Learning, a growing sub-field of machine learning, has been applied with tremendous success in ...
Labeled data is crucial for the success of machine learning-based artificial intelligence. However, ...
In Natural Language Processing (NLP), applications trained on downstream tasks for text classificati...
Deep learning has achieved state-of-the-art performance on a wide range of tasks, including natural ...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
The best performing NLP models to date are learned from large volumes of manually-annotated data. Fo...
The best performing NLP models to date are learned from large volumes of manually-annotated data. Fo...
Reference texts such as encyclopedias and news articles can manifest biased language when objective ...
Many NLP applications require manual text annotations for a variety of tasks, notably to train class...
In recent decades, the society depends more and more on computers for a large number of tasks. The f...
Many NLP applications require manual text annotations for a variety of tasks, notably to train class...
Natural Language Processing (NLP) is a sub-field of Artificial Intelligence and Linguistics, with th...
Crowdsourcing platforms are often used to collect datasets for training machine learning models, des...
The best performing NLP models to date are learned from large volumes of manually-annotated data. Fo...
Deep Learning, a growing sub-field of machine learning, has been applied with tremendous success in ...
Deep Learning, a growing sub-field of machine learning, has been applied with tremendous success in ...
Labeled data is crucial for the success of machine learning-based artificial intelligence. However, ...
In Natural Language Processing (NLP), applications trained on downstream tasks for text classificati...
Deep learning has achieved state-of-the-art performance on a wide range of tasks, including natural ...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
The best performing NLP models to date are learned from large volumes of manually-annotated data. Fo...
The best performing NLP models to date are learned from large volumes of manually-annotated data. Fo...
Reference texts such as encyclopedias and news articles can manifest biased language when objective ...
Many NLP applications require manual text annotations for a variety of tasks, notably to train class...
In recent decades, the society depends more and more on computers for a large number of tasks. The f...
Many NLP applications require manual text annotations for a variety of tasks, notably to train class...