The dissertation deals with clustering algorithms and transforming regression prob-lems into classification problems. The main contributions of the dissertation are twofold; first, to improve (speed up) the clustering algorithms and second, to develop a strict learn-ing environment for solving regression problems as classification tasks by using support vector machines (SVMs). An extension to the most popular unsupervised clustering meth-od, k-means algorithm, is proposed, dubbed k-means2 (k-means squared) algorithm, appli-cable to ultra large datasets. The main idea is based on using a small portion of the dataset in the first stage of the clustering. Thus, the centers of such a smaller dataset are computed much faster than if computing th...
Data mining is essentially the discovery of valuable information and patterns from huge chunks of av...
Abstract: Clustering is one of the fastest growing research areas because of availability of huge am...
The objective of this project is two-fold: the first one is to perform pre-processing on high-dim...
Working with huge amount of data and learning from it by extracting useful information is one of the...
Advances in technology have provided industry with an array of devices for collecting data. The freq...
Abstract Cluster analysis has become one of the main tools used in extracting knowledge from data, w...
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
An SVM-based clustering algorithm is introduced that clusters data with no a priori knowledge of in...
Data clustering techniques are valuable tools for researchers working with large databases of multiv...
Advances in technology have provided industry with an array of devices for collecting data. The freq...
The research discusses Computational Data Analysis Classification, a short summary of the classifica...
The data mining is the technique to analyze the complex data. The prediction analysis is the techniq...
Clustering is one of the most important research areas in the field of data mining. In simple words,...
Abstract — Data mining is the process used to analyze a large quantity of heterogeneous data to extr...
International audienceWe propose new parallel learning algorithms of local support vector machines (...
Data mining is essentially the discovery of valuable information and patterns from huge chunks of av...
Abstract: Clustering is one of the fastest growing research areas because of availability of huge am...
The objective of this project is two-fold: the first one is to perform pre-processing on high-dim...
Working with huge amount of data and learning from it by extracting useful information is one of the...
Advances in technology have provided industry with an array of devices for collecting data. The freq...
Abstract Cluster analysis has become one of the main tools used in extracting knowledge from data, w...
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
An SVM-based clustering algorithm is introduced that clusters data with no a priori knowledge of in...
Data clustering techniques are valuable tools for researchers working with large databases of multiv...
Advances in technology have provided industry with an array of devices for collecting data. The freq...
The research discusses Computational Data Analysis Classification, a short summary of the classifica...
The data mining is the technique to analyze the complex data. The prediction analysis is the techniq...
Clustering is one of the most important research areas in the field of data mining. In simple words,...
Abstract — Data mining is the process used to analyze a large quantity of heterogeneous data to extr...
International audienceWe propose new parallel learning algorithms of local support vector machines (...
Data mining is essentially the discovery of valuable information and patterns from huge chunks of av...
Abstract: Clustering is one of the fastest growing research areas because of availability of huge am...
The objective of this project is two-fold: the first one is to perform pre-processing on high-dim...