In this paper we discuss five different corpora annotated for protein names. We present several within- and cross-dataset protein tagging experiments showing that different annotation schemes severely affect the portability of statistical protein taggers. By means of a detailed error analysis we identify crucial annotation issues that future annotation projects should take into careful consideration
Includes bibliographical references (l. 60-61).The number of newly discovered proteins has increased...
Proteins are macromolecules responsible for a wide range of activities in the structure and function...
Databases of protein sequences have grown rapidly in recent years as a result of genome sequencing p...
In this paper we discuss five different corpora annotated for protein names. We present several with...
The research described in this paper addresses the following question: How well do generic protein/g...
Motivation: Annotations are a key feature of many biological databases, used to convey our knowledge...
We explore the sources of incompatibility between the protein annotations made to two corpora: GENIA...
MOTIVATION: The identification of protein and gene names (PGNs) from the scientific literature requi...
atics.oxfordjournals.org/ D ow nloaded from-2- Krebs and Bourne Motivation: Assignment of putative p...
This chapter introduces the use of Text Mining in scientific literature for biological research, wit...
Two factors dominate current developments in structural bioinformatics, especially in protein inform...
Whereas many applications of natural language processing for molecular biology focus on protein name...
Experimentally-verified information on protein function lags far behind the rapid accumulation of pr...
Abstract Background A protein annotation database, such as the Universal Protein Resource knowledge ...
Background: A protein annotation database, such as the Universal Protein Resource (UniProtKB), is a ...
Includes bibliographical references (l. 60-61).The number of newly discovered proteins has increased...
Proteins are macromolecules responsible for a wide range of activities in the structure and function...
Databases of protein sequences have grown rapidly in recent years as a result of genome sequencing p...
In this paper we discuss five different corpora annotated for protein names. We present several with...
The research described in this paper addresses the following question: How well do generic protein/g...
Motivation: Annotations are a key feature of many biological databases, used to convey our knowledge...
We explore the sources of incompatibility between the protein annotations made to two corpora: GENIA...
MOTIVATION: The identification of protein and gene names (PGNs) from the scientific literature requi...
atics.oxfordjournals.org/ D ow nloaded from-2- Krebs and Bourne Motivation: Assignment of putative p...
This chapter introduces the use of Text Mining in scientific literature for biological research, wit...
Two factors dominate current developments in structural bioinformatics, especially in protein inform...
Whereas many applications of natural language processing for molecular biology focus on protein name...
Experimentally-verified information on protein function lags far behind the rapid accumulation of pr...
Abstract Background A protein annotation database, such as the Universal Protein Resource knowledge ...
Background: A protein annotation database, such as the Universal Protein Resource (UniProtKB), is a ...
Includes bibliographical references (l. 60-61).The number of newly discovered proteins has increased...
Proteins are macromolecules responsible for a wide range of activities in the structure and function...
Databases of protein sequences have grown rapidly in recent years as a result of genome sequencing p...