Mining time series motifs is a fundamental, yet expensive task in exploratory data analytics. In this paper, we therefore propose a fast method to find the top-k motifs with probabilistic guarantees. Our probabilistic approach is based on Locality Sensitive Hashing and allows to prune most of the distance computations, leading to huge speedups. We improve on a straightforward application of LSH to time series data by developing a self-tuning algorithm that adapts to the data distribution. Furthermore, we include several optimizations to the algorithm, reducing redundant computations and leveraging the structure of time series data to speed up LSH computations. We prove the correctness of the algorithm and provide bounds to the cost of the b...
Temporal networks are mathematical tools used to model complex systems which embed the temporal dime...
Copyright © 2013 ACM. Mining probabilistic frequent patterns from uncertain data has received a grea...
Data uncertainty is inherent in many real-world applications such as environmental surveillance and ...
Time series motifs are approximately repeated patterns found within the data. Such motifs have utili...
A motif is a pair of non-overlapping sequences with very similar shapes in a time series. We study t...
Primitives such as motifs, discords, shapelets, etc., are widely used in time series data mining. A ...
Given the ubiquity of time series data in scientific, medical and financial domains, data miners hav...
Data mining and knowledge discovery algorithms for time series data use primitives such as bursts, p...
Efficiently finding similar segments or motifs in time series data is a fundamental task that, due t...
Finding motifs in time-series is proposed to make clustering of time-series subsequences meaningful,...
Time series motifs are approximately repeated subsequences found within a longer time series. They h...
In recent years, time series motif discovery has emerged as perhaps the most important primitive for...
Subsequences-based time series classification algorithms provide interpretable and generally more ac...
In many time series data mining problems, the analysis can be reduced to frequent pattern mining. Sp...
Time series motifs have been in the literature for about fifteen years, but have only recently begun...
Temporal networks are mathematical tools used to model complex systems which embed the temporal dime...
Copyright © 2013 ACM. Mining probabilistic frequent patterns from uncertain data has received a grea...
Data uncertainty is inherent in many real-world applications such as environmental surveillance and ...
Time series motifs are approximately repeated patterns found within the data. Such motifs have utili...
A motif is a pair of non-overlapping sequences with very similar shapes in a time series. We study t...
Primitives such as motifs, discords, shapelets, etc., are widely used in time series data mining. A ...
Given the ubiquity of time series data in scientific, medical and financial domains, data miners hav...
Data mining and knowledge discovery algorithms for time series data use primitives such as bursts, p...
Efficiently finding similar segments or motifs in time series data is a fundamental task that, due t...
Finding motifs in time-series is proposed to make clustering of time-series subsequences meaningful,...
Time series motifs are approximately repeated subsequences found within a longer time series. They h...
In recent years, time series motif discovery has emerged as perhaps the most important primitive for...
Subsequences-based time series classification algorithms provide interpretable and generally more ac...
In many time series data mining problems, the analysis can be reduced to frequent pattern mining. Sp...
Time series motifs have been in the literature for about fifteen years, but have only recently begun...
Temporal networks are mathematical tools used to model complex systems which embed the temporal dime...
Copyright © 2013 ACM. Mining probabilistic frequent patterns from uncertain data has received a grea...
Data uncertainty is inherent in many real-world applications such as environmental surveillance and ...