Denoising is the essential step for distant supervision based named entity recognition. Previous denoising methods are mostly based on instance-level confidence statistics, which ignore the variety of the underlying noise distribution on different datasets and entity types. This makes them difficult to be adapted to high noise rate settings. In this paper, we propose Hypergeometric Learning (HGL), a denoising algorithm for distantly supervised NER that takes both noise distribution and instance-level confidence into consideration. Specifically, during neural network training, we naturally model the noise samples in each batch following a hypergeometric distribution parameterized by the noise-rate. Then each instance in the batch is regarded...
This thesis studies the effect of adding a term usually neglected during the training phase of ener...
A small portion of mislabeled data can easily limit the performance of deep neural networks (DNNs) d...
We devise a new regularization for denoising with self-supervised learning. The regularization uses ...
Named entity recognition plays an important role in extracting valuable information from digital lib...
Existing named entity recognition (NER) systems rely on large amounts of human-labeled data for supe...
Noise is inherent in real world datasets and modeling noise is critical during training, as it is ef...
Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the data scarcity prob...
Recent approaches based on artificial neural networks (ANNs) have shown promising results for named-...
Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the data scarcity prob...
The remarkable success of deep learning is largely attributed to the collection of large datasets wi...
Deep learning attempts medical image denoising either by directly learning the noise present or via ...
The drastic increase of data quantity often brings the severe decrease of data quality, such as inco...
Many application domains, spanning from low-level computer vision to medical imaging, require high-f...
Deep Belief Networks which are hierarchical generative models are effective tools for feature repres...
Deep neural networks are susceptible to label noise. Existing methods to improve robustness, such as...
This thesis studies the effect of adding a term usually neglected during the training phase of ener...
A small portion of mislabeled data can easily limit the performance of deep neural networks (DNNs) d...
We devise a new regularization for denoising with self-supervised learning. The regularization uses ...
Named entity recognition plays an important role in extracting valuable information from digital lib...
Existing named entity recognition (NER) systems rely on large amounts of human-labeled data for supe...
Noise is inherent in real world datasets and modeling noise is critical during training, as it is ef...
Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the data scarcity prob...
Recent approaches based on artificial neural networks (ANNs) have shown promising results for named-...
Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the data scarcity prob...
The remarkable success of deep learning is largely attributed to the collection of large datasets wi...
Deep learning attempts medical image denoising either by directly learning the noise present or via ...
The drastic increase of data quantity often brings the severe decrease of data quality, such as inco...
Many application domains, spanning from low-level computer vision to medical imaging, require high-f...
Deep Belief Networks which are hierarchical generative models are effective tools for feature repres...
Deep neural networks are susceptible to label noise. Existing methods to improve robustness, such as...
This thesis studies the effect of adding a term usually neglected during the training phase of ener...
A small portion of mislabeled data can easily limit the performance of deep neural networks (DNNs) d...
We devise a new regularization for denoising with self-supervised learning. The regularization uses ...