Abstract Background In the context of the BioCreative competition, where training data were very sparse, we investigated two complementary tasks: 1) given a Swiss-Prot triplet, containing a protein, a GO (Gene Ontology) term and a relevant article, extraction of a short passage that justifies the GO category assignement; 2) given a Swiss-Prot pair, containing a protein and a relevant article, automatic assignement of a set of categories. Methods Sentence is the basic retrieval unit. Our classifier computes a distance between each sentence and the GO category provided with the Swiss-Prot entry. The Text Categorizer computes a distance between each GO term and the text of the article. Evaluations are reported both based on annotator judgement...
Gene Ontology is used extensively in scientific knowledgebases and repositories to organize a wealth...
Much has been written recently about the need for effective tools and methods for mining the wealth ...
Background: Manual curation of experimental data from the biomedical literature is an expensive and ...
BACKGROUND: In the context of the BioCreative competition, where training data were very sparse, we ...
Abstract Background The Gene Ontology Annotation (GOA) database http://www.ebi.ac.uk/GOA aims to pro...
The available curated data lag behind current biological knowledge contained in the literature. Text...
This article describes our participation of the Gene Ontology Curation task (GO task) in BioCreative...
BACKGROUND: This paper describes and evaluates a sentence selection engine that extracts a GeneRiF (...
Gene Ontology (GO) annotation is a common task among model organism databases (MODs) for capturing g...
In this chapter, we explain how text mining can support the curation of molecular biology databases ...
Abstract Background We participated in the BioCreAtIvE Task 2, which addressed the annotation of pro...
Information extraction aims to derive from free text signicant information related to a given query ...
Motivation: Searching relevant publications for manual database annotation is a tedious task. In thi...
Automated protein annotation using the Gene Ontology (GO) plays an important role in the biosciences...
Background. Communalities between large sets of genes obtained from high-throughput...
Gene Ontology is used extensively in scientific knowledgebases and repositories to organize a wealth...
Much has been written recently about the need for effective tools and methods for mining the wealth ...
Background: Manual curation of experimental data from the biomedical literature is an expensive and ...
BACKGROUND: In the context of the BioCreative competition, where training data were very sparse, we ...
Abstract Background The Gene Ontology Annotation (GOA) database http://www.ebi.ac.uk/GOA aims to pro...
The available curated data lag behind current biological knowledge contained in the literature. Text...
This article describes our participation of the Gene Ontology Curation task (GO task) in BioCreative...
BACKGROUND: This paper describes and evaluates a sentence selection engine that extracts a GeneRiF (...
Gene Ontology (GO) annotation is a common task among model organism databases (MODs) for capturing g...
In this chapter, we explain how text mining can support the curation of molecular biology databases ...
Abstract Background We participated in the BioCreAtIvE Task 2, which addressed the annotation of pro...
Information extraction aims to derive from free text signicant information related to a given query ...
Motivation: Searching relevant publications for manual database annotation is a tedious task. In thi...
Automated protein annotation using the Gene Ontology (GO) plays an important role in the biosciences...
Background. Communalities between large sets of genes obtained from high-throughput...
Gene Ontology is used extensively in scientific knowledgebases and repositories to organize a wealth...
Much has been written recently about the need for effective tools and methods for mining the wealth ...
Background: Manual curation of experimental data from the biomedical literature is an expensive and ...