The co-occurrence of terms in a text corpus may indicate the presence of a relation between the referents of these terms. We expect co-occurrence-based methods to identify association relations that cannot be found using static patterns. We developed a new method to identify associations between ontological categories in text using the co-occurrence of terms that designate these categories. We use the taxonomic structure of the ontologies to cumulate the number of co-occurrences of terms designating categories. Based on these cumulated values, we designed a novel family of statistical tests to identify associated categories. These tests take both co-occurrence specificity and relevance into consideration. We applied our method to a 2.2 GB t...
The Gene Ontology (GO) has become the internationally accepted standard for representing function, p...
With more and more genomes being sequenced, a lot of effort is devoted to their annotation with term...
With more and more genomes being sequenced, a lot of effort is devoted to their annotation with term...
The co-occurrence of terms in a text corpus may indicate the presence of a relation between the refe...
The co-occurrence of terms in a text corpus may indicate the presence of a relation between the refe...
The co-occurrence of terms in a text corpus may indicate the presence of a relation between the refe...
Abstract It is a challenging task to match similar or related terms/expressions in NLP and Text Mini...
Abstract Background Gene Ontology (GO) has been widely used in biological databases, annotation proj...
Abstract Background The Gene Ontology (GO) is a community-based bioinformatics resource that employs...
We used exact term matching, stemming, and inclusion of synonyms, implemented via the Lucene informa...
The Gene Ontology (GO) is a controlled vocabulary widely used for the annotation of gene products. G...
Semantic relatedness is a measure that quantifies the strength of a semantic link between two concep...
Lexical co-occurrence is an important cue for detecting word associations. We present a theoretical ...
The Gene Ontology (GO) is a controlled vocabulary widely used for the annotation of gene products. G...
A method for computing associations between nodes in a topic taxonomy and content items in a corpus ...
The Gene Ontology (GO) has become the internationally accepted standard for representing function, p...
With more and more genomes being sequenced, a lot of effort is devoted to their annotation with term...
With more and more genomes being sequenced, a lot of effort is devoted to their annotation with term...
The co-occurrence of terms in a text corpus may indicate the presence of a relation between the refe...
The co-occurrence of terms in a text corpus may indicate the presence of a relation between the refe...
The co-occurrence of terms in a text corpus may indicate the presence of a relation between the refe...
Abstract It is a challenging task to match similar or related terms/expressions in NLP and Text Mini...
Abstract Background Gene Ontology (GO) has been widely used in biological databases, annotation proj...
Abstract Background The Gene Ontology (GO) is a community-based bioinformatics resource that employs...
We used exact term matching, stemming, and inclusion of synonyms, implemented via the Lucene informa...
The Gene Ontology (GO) is a controlled vocabulary widely used for the annotation of gene products. G...
Semantic relatedness is a measure that quantifies the strength of a semantic link between two concep...
Lexical co-occurrence is an important cue for detecting word associations. We present a theoretical ...
The Gene Ontology (GO) is a controlled vocabulary widely used for the annotation of gene products. G...
A method for computing associations between nodes in a topic taxonomy and content items in a corpus ...
The Gene Ontology (GO) has become the internationally accepted standard for representing function, p...
With more and more genomes being sequenced, a lot of effort is devoted to their annotation with term...
With more and more genomes being sequenced, a lot of effort is devoted to their annotation with term...