Clustering homologous sequences based on their similarity is a problem that appears in many bioinformatics applications. The fact that sequences cluster is ultimately the result of their phylogenetic relationships. Despite this observation and the natural ways in which a tree can define clusters, most applications of sequence clustering do not use a phylogenetic tree and instead operate on pairwise sequence distances. Due to advances in large-scale phylogenetic inference, we argue that tree-based clustering is under-utilized. We define a family of optimization problems that, given an arbitrary tree, return the minimum number of clusters such that all clusters adhere to constraints on their heterogeneity. We study three specific constraints,...
Phylogenetics is one of the dominant data engineering research disciplines based on biological infor...
Cluster analysis or clustering is an important data mining technique widely used for pattern recogni...
MotivationClustering is a fundamental task in the analysis of nucleotide sequences. Despite the expo...
Clustering homologous sequences based on their similarity is a problem that appears in many bioinfor...
Clustering homologous sequences based on their similarity is a problem that appears in many bioinfor...
This paper explores clustering algorithms to construct a phylogenetic tree, based on distance measur...
AbstractA phylogenetic tree or an evolutionary tree is a graph that shows the evolutionary relations...
Comparing two or more phylogenetic trees is a fundamental task in computational biology. The simples...
ABSTRACT. Inferential summaries of tree estimates are useful in the setting of evolutionary biology,...
In the context of phylogenetic tree reconstruction, divisive clustering methods can be used to infer...
Abstract Neighbor-joining is a well-established hierarchical clustering algorithm for inferring phyl...
Abstract Neighbor-joining is a well-established hierarchical clustering algorithm for inferring phyl...
Clustering and organizing molecular sequences is one of the central tasks in Bioinformatics. It is a...
Clustering and organizing molecular sequences is one of the central tasks in Bioinformatics. It is a...
Phylogenetics is one of the dominant data engineering research disciplines based on biological infor...
Phylogenetics is one of the dominant data engineering research disciplines based on biological infor...
Cluster analysis or clustering is an important data mining technique widely used for pattern recogni...
MotivationClustering is a fundamental task in the analysis of nucleotide sequences. Despite the expo...
Clustering homologous sequences based on their similarity is a problem that appears in many bioinfor...
Clustering homologous sequences based on their similarity is a problem that appears in many bioinfor...
This paper explores clustering algorithms to construct a phylogenetic tree, based on distance measur...
AbstractA phylogenetic tree or an evolutionary tree is a graph that shows the evolutionary relations...
Comparing two or more phylogenetic trees is a fundamental task in computational biology. The simples...
ABSTRACT. Inferential summaries of tree estimates are useful in the setting of evolutionary biology,...
In the context of phylogenetic tree reconstruction, divisive clustering methods can be used to infer...
Abstract Neighbor-joining is a well-established hierarchical clustering algorithm for inferring phyl...
Abstract Neighbor-joining is a well-established hierarchical clustering algorithm for inferring phyl...
Clustering and organizing molecular sequences is one of the central tasks in Bioinformatics. It is a...
Clustering and organizing molecular sequences is one of the central tasks in Bioinformatics. It is a...
Phylogenetics is one of the dominant data engineering research disciplines based on biological infor...
Phylogenetics is one of the dominant data engineering research disciplines based on biological infor...
Cluster analysis or clustering is an important data mining technique widely used for pattern recogni...
MotivationClustering is a fundamental task in the analysis of nucleotide sequences. Despite the expo...