Continuously identifying pre-defined patterns in a streaming time series has strong demand in various applications. While most existing works assume the patterns are in equal length and tolerance, this work focuses on the problem where the patterns have various lengths and tolerances, a common situation in the real world. The challenge of this problem roots on the strict space and time requirements of processing the arriving and expiring data in high-speed stream, combined with difficulty of coping with a large number of patterns with various lengths and tolerances. We introduce a novel concept of converging envelope which bounds the tolerance of a group of patterns in various tolerances and equal length and thus dramatically reduces the nu...
In many time series data mining problems, the analysis can be reduced to frequent pattern mining. Sp...
Many classical algorithms for string processing assume that the input can be accessed in full via co...
We introduce a method to discover optimal local patterns, which concisely describe the main trends i...
We investigate the problem of deterministic pattern matching in multiple streams. In this model, one...
We introduce and study the problem of computing the simi- larity self-join in a streaming context (s...
Frequent pattern discovery over data stream is a hard problem because a continuously generated natur...
Similarity-based time-series retrieval has been a subject of long-term study due to its wide usage i...
Abstract. Discovering frequent patterns over event sequences is an important data mining problem. Ex...
Nowadays online monitoring of data streams is essential in many real life applications, like sensor ...
Similarity-based time series retrieval has been a subject of long term study due to its wide usage i...
This thesis is concerned with the study of problems related to the measurement of disorder in the da...
International audienceIn recent years, emerging applications introduced new constraints for data min...
Although frequent-pattern mining has been widely studied and used, it is challenging to extend it to...
In this dissertation, we present algorithms that approximate properties in the data stream model, wh...
Traditional data mining techniques expect all data to be managed within some form of persistent data...
In many time series data mining problems, the analysis can be reduced to frequent pattern mining. Sp...
Many classical algorithms for string processing assume that the input can be accessed in full via co...
We introduce a method to discover optimal local patterns, which concisely describe the main trends i...
We investigate the problem of deterministic pattern matching in multiple streams. In this model, one...
We introduce and study the problem of computing the simi- larity self-join in a streaming context (s...
Frequent pattern discovery over data stream is a hard problem because a continuously generated natur...
Similarity-based time-series retrieval has been a subject of long-term study due to its wide usage i...
Abstract. Discovering frequent patterns over event sequences is an important data mining problem. Ex...
Nowadays online monitoring of data streams is essential in many real life applications, like sensor ...
Similarity-based time series retrieval has been a subject of long term study due to its wide usage i...
This thesis is concerned with the study of problems related to the measurement of disorder in the da...
International audienceIn recent years, emerging applications introduced new constraints for data min...
Although frequent-pattern mining has been widely studied and used, it is challenging to extend it to...
In this dissertation, we present algorithms that approximate properties in the data stream model, wh...
Traditional data mining techniques expect all data to be managed within some form of persistent data...
In many time series data mining problems, the analysis can be reduced to frequent pattern mining. Sp...
Many classical algorithms for string processing assume that the input can be accessed in full via co...
We introduce a method to discover optimal local patterns, which concisely describe the main trends i...