<p>The results have been sorted according to the term length (x = 1 to 89) and the frequencies are presented in logarithmic scale (y = 0 to 6.0). After sorting, the results for the terms have been grouped into bins where each bin represents terms of a given length +/−1. For GP7 the overall occurrence is given, for the other resources the numbers indicate how many occurrences of a GP7 term contain a term of the alternative resource, e.g. ChEBI. A large portion of GP7 terms do contain ChEBI terms, and - to a lower rate - a disease or a species term. It is obvious that longer terms are more likely to be composed of terms of a different semantic type. According to the annotation guidelines, species terms should not be part of the PGN.</p
Formulaic sequences in language use are often studied by means of the automatic identification of fr...
Corpus-level term statistics are valuable for numerous text analysis activities, such as term weight...
(A) Pangenome sizes as a function of the number of genomes analyzed for the BSI (912 strains) and co...
<p>The results have been sorted according to the term length and are presented in logarithmic scale ...
<p>The table shows the distribution of terms from LexEBI sorted according to the resource that deliv...
MOTIVATION: Biomedical entities, their identifiers and names, are essential in the representation of...
Motivation: Biomedical entities, their identifiers and names, are essential in the representation of...
Biomedical entities, their identifiers and names, are essential in the representation of biomedical ...
We present here an approach and algorithm for mining gen-eralized term associations. The problem is ...
<p>The abbreviations extracted from Medline have been attributed to a reference terminological resou...
<p>The content of the mentioned five resources, i.e. Enzymes, Interpro, Jochem, ChEBI and Species, a...
MOTIVATION: The identification of protein and gene names (PGNs) from the scientific literature requi...
<p>The terms from LexEBI have been cross-compared for the identification of nested terms. The figure...
Terminologies and other knowledge resources are widely used to aid entity recognition in specialist ...
<p>The reference data resource (“tagged term”) is either GP6 or GP7 and the alternative data resourc...
Formulaic sequences in language use are often studied by means of the automatic identification of fr...
Corpus-level term statistics are valuable for numerous text analysis activities, such as term weight...
(A) Pangenome sizes as a function of the number of genomes analyzed for the BSI (912 strains) and co...
<p>The results have been sorted according to the term length and are presented in logarithmic scale ...
<p>The table shows the distribution of terms from LexEBI sorted according to the resource that deliv...
MOTIVATION: Biomedical entities, their identifiers and names, are essential in the representation of...
Motivation: Biomedical entities, their identifiers and names, are essential in the representation of...
Biomedical entities, their identifiers and names, are essential in the representation of biomedical ...
We present here an approach and algorithm for mining gen-eralized term associations. The problem is ...
<p>The abbreviations extracted from Medline have been attributed to a reference terminological resou...
<p>The content of the mentioned five resources, i.e. Enzymes, Interpro, Jochem, ChEBI and Species, a...
MOTIVATION: The identification of protein and gene names (PGNs) from the scientific literature requi...
<p>The terms from LexEBI have been cross-compared for the identification of nested terms. The figure...
Terminologies and other knowledge resources are widely used to aid entity recognition in specialist ...
<p>The reference data resource (“tagged term”) is either GP6 or GP7 and the alternative data resourc...
Formulaic sequences in language use are often studied by means of the automatic identification of fr...
Corpus-level term statistics are valuable for numerous text analysis activities, such as term weight...
(A) Pangenome sizes as a function of the number of genomes analyzed for the BSI (912 strains) and co...