The health and life science domains are well known for their wealth of named entities found in large free text corpora, such as scientific literature and electronic health records. To unlock the value of such corpora, named entity recognition (NER) methods are proposed. Inspired by the success of transformer-based pretrained models for NER, we assess how individual and ensemble of deep masked language models perform across corpora of different health and life science domains—biology, chemistry, and medicine—available in different languages—English and French. Individual deep masked language models, pretrained on external corpora, are fined-tuned on task-specific domain and language corpora and ensembled using classical majority voting strat...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
Recent improvements in machine-reading technologies attracted much attention to automation problems ...
Recent improvements in machine-reading technologies attracted much attention to automation problems ...
The health and life science domains are well known for their wealth of named entities found in large...
The health and life science domains are well known for their wealth of named entities found in large...
The health and life science domains are well known for their wealth of named entities found in large...
Named Entities (NEs) in biomedical text refer to objects that are of interest to biomedical research...
As vast amounts of unstructured data are becoming available digitally, computer-based methods to ext...
As vast amounts of unstructured data are becoming available digitally, computer-based methods to ext...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
Abstract. We present a named-entity recognition (NER) system for parallel multilingual text. Our sys...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
Recent improvements in machine-reading technologies attracted much attention to automation problems ...
Recent improvements in machine-reading technologies attracted much attention to automation problems ...
The health and life science domains are well known for their wealth of named entities found in large...
The health and life science domains are well known for their wealth of named entities found in large...
The health and life science domains are well known for their wealth of named entities found in large...
Named Entities (NEs) in biomedical text refer to objects that are of interest to biomedical research...
As vast amounts of unstructured data are becoming available digitally, computer-based methods to ext...
As vast amounts of unstructured data are becoming available digitally, computer-based methods to ext...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
Abstract. We present a named-entity recognition (NER) system for parallel multilingual text. Our sys...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
Recent improvements in machine-reading technologies attracted much attention to automation problems ...
Recent improvements in machine-reading technologies attracted much attention to automation problems ...