. Efficient data mining algorithms are crucial for effective knowledge discovery. We present the Multi-Stream Dependency Detection (msdd) data mining algorithm that performs a systematic search for structure in multivariate time series of categorical data. The systematicity of msdd's search makes implementation of both parallel and distributed versions straightforward. Distributing the search for structure over multiple processors or networked machines makes mining of large numbers of databases or very large databases feasible. We present results showing that msdd efficiently finds complex structure in multivariate time series, and that the distributed version finds the same structure in approximately 1=n of the time required by msdd...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
Advances in hardware and software technology enable us to collect, store and distribute large quanti...
The efficient mining of large, commercially credible, databases requires a solution to at least two ...
The Multi-Stream Dependency Detection algorithm finds rules that capture statistical dependencies be...
Finding structure in multiple streams of data is an important problem. Consider the streams of data ...
Learning complex dependencies from time series data is an important task; dependencies can be used t...
rithm finds rules that capture statistical depen-dencies between patterns in multivariate time serie...
Abstract Recent years have shown the need of an automated process to discover interesting and hidden...
Abstract. Sequential pattern mining is an active field in the domain of knowledge discovery and has ...
International audienceThe problem of extracting functional dependencies (FDs) from databases has a l...
Abstract—We present a new approach related to the discovery of correlated patterns based on the use ...
This book presents recent advances in Knowledge discovery in databases (KDD) with a focus on the are...
Abstract The data storage paradigm has changed in the last decade, from operational databases to dat...
Much of current data mining research is focused on discovering sets of attributes that discriminate ...
With the advances in information technology, voluminous data in such domains as the internet, market...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
Advances in hardware and software technology enable us to collect, store and distribute large quanti...
The efficient mining of large, commercially credible, databases requires a solution to at least two ...
The Multi-Stream Dependency Detection algorithm finds rules that capture statistical dependencies be...
Finding structure in multiple streams of data is an important problem. Consider the streams of data ...
Learning complex dependencies from time series data is an important task; dependencies can be used t...
rithm finds rules that capture statistical depen-dencies between patterns in multivariate time serie...
Abstract Recent years have shown the need of an automated process to discover interesting and hidden...
Abstract. Sequential pattern mining is an active field in the domain of knowledge discovery and has ...
International audienceThe problem of extracting functional dependencies (FDs) from databases has a l...
Abstract—We present a new approach related to the discovery of correlated patterns based on the use ...
This book presents recent advances in Knowledge discovery in databases (KDD) with a focus on the are...
Abstract The data storage paradigm has changed in the last decade, from operational databases to dat...
Much of current data mining research is focused on discovering sets of attributes that discriminate ...
With the advances in information technology, voluminous data in such domains as the internet, market...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
Advances in hardware and software technology enable us to collect, store and distribute large quanti...
The efficient mining of large, commercially credible, databases requires a solution to at least two ...