This paper describes a methodology for discovering and resolving protein names abbreviations from the full-text versions of scientific articles, implemented in the PRAISED framework with the ultimate purpose of building up a publicly available abbreviation repository. Three processing steps lie at the core of the framework: i) an abbreviation identification phase, carried out via domain-independent metrics, whose purpose is to identify all possible abbreviations within a scientific text; ii) an abbreviation resolution phase, which takes into account a number of syntactical and semantic criteria in order to match an abbreviation with its potential explanation; and iii) a dictionary-based protein name identification, which is meant to select ...
Background: Significant parts of biological knowledge are available only as unstructured text in art...
Motivation: Acronyms result from a highly productive type of term variation and trigger the need for...
Background: Identification of gene and protein names in biomedical text is a challenging task as the...
This paper describes a methodology for discovering and resolving protein names abbreviations from th...
Abstract Background The exploding growth of the biomedical literature presents many challenges for b...
A prerequisite for all higher level information extraction tasks is the identification of unknown na...
Motivation: Biological literature contains many abbreviati-ons with one particular sense in each doc...
The explosion of biomedical literature and with it the-uncontrolled- creation of abbreviations prese...
Genes and proteins are often associated with multiple names, and more names are added as new functio...
Genes and proteins are often associated with multiple names, and more names are added as new functio...
AbstractMotivation. Natural language processing (NLP) techniques are used to extract information aut...
We present a system to identify abbreviation expansion pairs from scientific articles. We work with ...
With the increasing amount of biomedical literature, there is a need for automatic extraction of inf...
Automatically extracting information from biomedical text holds the promise of easily consolidating ...
Whereas many applications of natural language processing for molecular biology focus on protein name...
Background: Significant parts of biological knowledge are available only as unstructured text in art...
Motivation: Acronyms result from a highly productive type of term variation and trigger the need for...
Background: Identification of gene and protein names in biomedical text is a challenging task as the...
This paper describes a methodology for discovering and resolving protein names abbreviations from th...
Abstract Background The exploding growth of the biomedical literature presents many challenges for b...
A prerequisite for all higher level information extraction tasks is the identification of unknown na...
Motivation: Biological literature contains many abbreviati-ons with one particular sense in each doc...
The explosion of biomedical literature and with it the-uncontrolled- creation of abbreviations prese...
Genes and proteins are often associated with multiple names, and more names are added as new functio...
Genes and proteins are often associated with multiple names, and more names are added as new functio...
AbstractMotivation. Natural language processing (NLP) techniques are used to extract information aut...
We present a system to identify abbreviation expansion pairs from scientific articles. We work with ...
With the increasing amount of biomedical literature, there is a need for automatic extraction of inf...
Automatically extracting information from biomedical text holds the promise of easily consolidating ...
Whereas many applications of natural language processing for molecular biology focus on protein name...
Background: Significant parts of biological knowledge are available only as unstructured text in art...
Motivation: Acronyms result from a highly productive type of term variation and trigger the need for...
Background: Identification of gene and protein names in biomedical text is a challenging task as the...