In this paper, we explore the discriminating subsequence-based clustering problem. First, several effective optimiza-tion techniques are proposed to accelerate the sequence min-ing process and a new algorithm, CONTOUR, is developed to efficiently and directly mine a subset of discriminating frequent subsequences which can be used to cluster the in-put sequences. Second, an accurate hierarchical clustering algorithm, SSC, is constructed based on the result of CON-TOUR. The performance study evaluates the efficiency and scalability of CONTOUR, and the clustering quality of SSC
Data Clustering is defined as grouping together objects which share similar properties. These proper...
Sequence decomposition into a set of consecutive, distinct subsequences is crucial for symbolic sequ...
Data mining applications place special requirements on clus-tering algorithms including: the ability...
We present a fast algorithm for sequence clustering and searching which works with large sequence da...
In recent years, we have seen an enormous growth in the amount of available commercial and scientifi...
Clustering is the division of data into groups of similar objects. The main objective of this unsupe...
We study the problem of pattern-based subspace clustering. Unlike traditional clustering methods tha...
Clustering of subsequence time series remains an open issue in time series clustering. Subsequence t...
The problem of sequence clustering is one of the fundamental research topics. However, most algorith...
In this thesis, we study scalable and general purpose methods for mining frequent sequences that sat...
In this paper, an efficient hierarchical clustering algorithm, suitable for large data sets is propo...
<div><p>The rapid development of sequencing technology has led to an explosive accumulation of genom...
The rapid development of sequencing technology has led to an explosive accumulation of genomic seque...
Frequent sequence mining methods often make use of constraints to control which subsequences should ...
We study scalable algorithms for frequent sequence mining under flexible subsequence constraints. Su...
Data Clustering is defined as grouping together objects which share similar properties. These proper...
Sequence decomposition into a set of consecutive, distinct subsequences is crucial for symbolic sequ...
Data mining applications place special requirements on clus-tering algorithms including: the ability...
We present a fast algorithm for sequence clustering and searching which works with large sequence da...
In recent years, we have seen an enormous growth in the amount of available commercial and scientifi...
Clustering is the division of data into groups of similar objects. The main objective of this unsupe...
We study the problem of pattern-based subspace clustering. Unlike traditional clustering methods tha...
Clustering of subsequence time series remains an open issue in time series clustering. Subsequence t...
The problem of sequence clustering is one of the fundamental research topics. However, most algorith...
In this thesis, we study scalable and general purpose methods for mining frequent sequences that sat...
In this paper, an efficient hierarchical clustering algorithm, suitable for large data sets is propo...
<div><p>The rapid development of sequencing technology has led to an explosive accumulation of genom...
The rapid development of sequencing technology has led to an explosive accumulation of genomic seque...
Frequent sequence mining methods often make use of constraints to control which subsequences should ...
We study scalable algorithms for frequent sequence mining under flexible subsequence constraints. Su...
Data Clustering is defined as grouping together objects which share similar properties. These proper...
Sequence decomposition into a set of consecutive, distinct subsequences is crucial for symbolic sequ...
Data mining applications place special requirements on clus-tering algorithms including: the ability...