Abstract Background Sequence similarity networks are useful for classifying and characterizing biologically important proteins. Threshold-based approaches to similarity network construction using exact distance measures are prohibitively slow to compute and rely on the difficult task of selecting an appropriate threshold, while similarity networks based on approximate distance calculations compromise useful structural information. Results We present an alternative network representation for a set of sequence data that overcomes these drawbacks. In our model, called the Directed Weighted All Nearest Neighbors (DiWANN) network, each sequence is represented by a node and is connected via a directed edge to only the closest sequence, or sequenc...
MOTIVATION: Biological network comparison software largely relies on the concept of alignment where ...
Biologists regularly search databases of DNA or protein sequences for evolutionary or functional rel...
BACKGROUND: The availability of microarrays measuring thousands of genes simultaneously across hundr...
AbstractA set of proteins is a complex system whose elements are interrelated on the concept of sequ...
The deluge of genomic data raises various challenges for computational protein annotation. The defin...
Motivation: A global view of the protein space is essential for functional and evolutionary analysis...
The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new...
The dramatic increase in heterogeneous types of biological data—in particular, the abundance of new ...
International audienceIn the post genomic era, large and complex molecular datasets from genome and ...
In this paper, we address the problem of identifying protein functionality using the information con...
Networks are powerful tools for the presentation and analysis of interactions in multi-component sys...
We address the problem of homology identification in complex multidomain families with varied domain...
In the genomic age, there is so much data that experimental validation on all of it is impractical. ...
BACKGROUND:The sequencing of the human genome has enabled us to access a comprehensive list of genes...
A relation exists between network proximity of molecular entities in interaction networks, functiona...
MOTIVATION: Biological network comparison software largely relies on the concept of alignment where ...
Biologists regularly search databases of DNA or protein sequences for evolutionary or functional rel...
BACKGROUND: The availability of microarrays measuring thousands of genes simultaneously across hundr...
AbstractA set of proteins is a complex system whose elements are interrelated on the concept of sequ...
The deluge of genomic data raises various challenges for computational protein annotation. The defin...
Motivation: A global view of the protein space is essential for functional and evolutionary analysis...
The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new...
The dramatic increase in heterogeneous types of biological data—in particular, the abundance of new ...
International audienceIn the post genomic era, large and complex molecular datasets from genome and ...
In this paper, we address the problem of identifying protein functionality using the information con...
Networks are powerful tools for the presentation and analysis of interactions in multi-component sys...
We address the problem of homology identification in complex multidomain families with varied domain...
In the genomic age, there is so much data that experimental validation on all of it is impractical. ...
BACKGROUND:The sequencing of the human genome has enabled us to access a comprehensive list of genes...
A relation exists between network proximity of molecular entities in interaction networks, functiona...
MOTIVATION: Biological network comparison software largely relies on the concept of alignment where ...
Biologists regularly search databases of DNA or protein sequences for evolutionary or functional rel...
BACKGROUND: The availability of microarrays measuring thousands of genes simultaneously across hundr...