We studied contrast and variability in a corpus of gene names to identify potential heuristics for use in performing entity identification in the molecular biology domain. Based on our findings, we developed heuristics for mapping weakly matching gene names to their official gene names. We then tested these heuristics against a large body of Medline abstracts, and found that using these heuristics can increase recall, with varying levels of precision. Our findings also underscored the importance of good information retrieval and of the ability to disambiguate between genes, proteins, RNA, and a variety of other referents for performing entity identification with high precision.
Named entity recognition of gene terms plays a big role in the increasing chal-lenge of extracting g...
Researchers tend to use their own or favourite gene names in scientific literature, even though ther...
This paper presents work on a method to detect names of proteins in running text. The detection and ...
We studied contrast and variability in a corpus of gene names to identify potential heuristics for u...
We studied contrast and variability in a corpus of gene names to identify potential heuristics for u...
The recognition and normalization of gene mentions in biomedical literature are crucial steps in bio...
The recognition and normalization of gene mentions in biomedical literature are crucial steps in bio...
The task of recognising biomedical named entities in natural language documents called biomedical Na...
AbstractNamed entity (NE) recognition is a fundamental task in biological relationship mining. This ...
Background Retrieving pertinent information from biological scientific literature requires cutting-...
Background: Identification of gene and protein names in biomedical text is a challenging task as the...
Linking gene and protein names mentioned in the literature to unique identifiers in referent genomic...
Background: Good automatic information extraction tools offer hope for automatic processing of the e...
In this paper we discuss the performance of a text-based classification approach by comparing differ...
Abstract Background Good automatic information extraction tools offer hope for automatic processing ...
Named entity recognition of gene terms plays a big role in the increasing chal-lenge of extracting g...
Researchers tend to use their own or favourite gene names in scientific literature, even though ther...
This paper presents work on a method to detect names of proteins in running text. The detection and ...
We studied contrast and variability in a corpus of gene names to identify potential heuristics for u...
We studied contrast and variability in a corpus of gene names to identify potential heuristics for u...
The recognition and normalization of gene mentions in biomedical literature are crucial steps in bio...
The recognition and normalization of gene mentions in biomedical literature are crucial steps in bio...
The task of recognising biomedical named entities in natural language documents called biomedical Na...
AbstractNamed entity (NE) recognition is a fundamental task in biological relationship mining. This ...
Background Retrieving pertinent information from biological scientific literature requires cutting-...
Background: Identification of gene and protein names in biomedical text is a challenging task as the...
Linking gene and protein names mentioned in the literature to unique identifiers in referent genomic...
Background: Good automatic information extraction tools offer hope for automatic processing of the e...
In this paper we discuss the performance of a text-based classification approach by comparing differ...
Abstract Background Good automatic information extraction tools offer hope for automatic processing ...
Named entity recognition of gene terms plays a big role in the increasing chal-lenge of extracting g...
Researchers tend to use their own or favourite gene names in scientific literature, even though ther...
This paper presents work on a method to detect names of proteins in running text. The detection and ...