Üks huvitav uurimisprobleem andmete analüüsimisel on mustriotsing. Mustrid võivad näidata kuidas andmed on tekkinud ja kuidas ta ennast kordab. Andmete mahu kiire kasvamise tõttu on vajadus algoritmidele, mis skaleeruvad mitmele protsessile. Selles töös me uurime kuidas paralleliseerida olemasolevat algoritmi kasutades kolme ideed: üldistamine, liigendamine ja reifitseerimine. Me rakendame neid ideid SPEXS-il, mustriotsingu algoritm, ning tuletame paralleelse algoritmi SPEXS2, mille me ka implementeerime. Lisaks me uurime probleeme, mis tekkisid selle algoritmi implementeerimisel. Selles töös tutvustatud ideid saab kasutada teiste algoritmide üldistamisel ning paralleliseerimisel.An interesting research problem in dataset analysis is the di...
An important issue in data mining is scalability with respect to the size of the dataset being min...
This paper addresses the discovery of sequential patterns in very large databases. Most of the exist...
The identification of interesting patterns (or subsequences) in biosequences has an important role i...
Dissertação apresentada na Faculdade de Ciências e Tecnologia da Universidade Nova de Lisboa para ob...
The explosive growth of data and the rapid progress of technology have led to a huge amount of data ...
Dans le domaine de l'extraction de motifs, il existe un grand nombre d'algorithmes pour résoudre une...
This study proposes effective methods to learn meaningful representations for complex data such as s...
In this paper, a novel technique for parallelizing data-classification problems is applied to findin...
The thesis is mainly focused on the study and the application of pattern discovery algorithms that a...
This thesis explores detecting patterns in the most general interface ...
No longer the preserve of specialist hardware, parallel devices are now ubiquitous. Pattern-based ...
The traditional frequent pattern mining algorithms generate an exponentially large number of pattern...
制度:新 ; 報告番号:甲3613号 ; 学位の種類:博士(国際情報通信学) ; 授与年月日:2012/2/24 ; 早大学位記番号:新5967textthesi
During the past decade, the degree of parallelism available in hardware has grown quickly and decisi...
The goal of this thesis was to implement a sequential algorithm that would search for subsequences ...
An important issue in data mining is scalability with respect to the size of the dataset being min...
This paper addresses the discovery of sequential patterns in very large databases. Most of the exist...
The identification of interesting patterns (or subsequences) in biosequences has an important role i...
Dissertação apresentada na Faculdade de Ciências e Tecnologia da Universidade Nova de Lisboa para ob...
The explosive growth of data and the rapid progress of technology have led to a huge amount of data ...
Dans le domaine de l'extraction de motifs, il existe un grand nombre d'algorithmes pour résoudre une...
This study proposes effective methods to learn meaningful representations for complex data such as s...
In this paper, a novel technique for parallelizing data-classification problems is applied to findin...
The thesis is mainly focused on the study and the application of pattern discovery algorithms that a...
This thesis explores detecting patterns in the most general interface ...
No longer the preserve of specialist hardware, parallel devices are now ubiquitous. Pattern-based ...
The traditional frequent pattern mining algorithms generate an exponentially large number of pattern...
制度:新 ; 報告番号:甲3613号 ; 学位の種類:博士(国際情報通信学) ; 授与年月日:2012/2/24 ; 早大学位記番号:新5967textthesi
During the past decade, the degree of parallelism available in hardware has grown quickly and decisi...
The goal of this thesis was to implement a sequential algorithm that would search for subsequences ...
An important issue in data mining is scalability with respect to the size of the dataset being min...
This paper addresses the discovery of sequential patterns in very large databases. Most of the exist...
The identification of interesting patterns (or subsequences) in biosequences has an important role i...