In order to solve the problem that principal component analysis (PCA) algorithm can??t deal with the reduction of clustering accuracy after high dimensional data reduction, a new attribute space concept is proposed. Based on the combination of attribute space and information entropy, the dimensionality reduction standard based on feature similarity is constructed. A new dimensionality reduction algorithm (entropy-PCA, EN-PCA) is proposed. Aiming at the problem that the post-dimension feature is a linear combination of original features, which leads to poor interpretability and inflexible input, a sparse principal component algorithm based on ridge regression (ESPCA) is proposed. The input of ESPCA algorithm is the PCA dimension reduction re...
We consider a clustering problem where we observe feature vectors Xi ∈ Rp, i = 1, 2,..., n, from K p...
Classification problem especially for high dimensional datasets have attracted many researchers in o...
Big databases are increasingly widespread and are therefore hard to understand, in exploratory biome...
Dimensionality reduction techniques are used to reduce the complexity for analysis of high dimension...
In line with the technological developments, the current data tends to be multidimensional and high ...
With the incredible growth of high dimensional data such as microarray gene expression data, the res...
k-means algorithm is a popular data clustering algorithm. k-means clustering aims to partition n obs...
Principal component analysis (PCA) is a commonly used method for feature extraction and dimensionali...
International audienceMining useful clusters from high dimensional data has received significant att...
When the data vectors are high-dimensional it is computationally infeasible to use data analysis or ...
Clustering as unsupervised learning method is the mission of dividing data objects into clusters wit...
The analysis of high-dimensional data often begins with the identification of lower dimensional subs...
Various dimensionality reduction (DR) schemes have been developed for projecting high-dimensional da...
Abstract- In this paper, a novel simple dimension reduction technique for classification is proposed...
Information explosion has occurred in most of the sciences and researches due to advances in data co...
We consider a clustering problem where we observe feature vectors Xi ∈ Rp, i = 1, 2,..., n, from K p...
Classification problem especially for high dimensional datasets have attracted many researchers in o...
Big databases are increasingly widespread and are therefore hard to understand, in exploratory biome...
Dimensionality reduction techniques are used to reduce the complexity for analysis of high dimension...
In line with the technological developments, the current data tends to be multidimensional and high ...
With the incredible growth of high dimensional data such as microarray gene expression data, the res...
k-means algorithm is a popular data clustering algorithm. k-means clustering aims to partition n obs...
Principal component analysis (PCA) is a commonly used method for feature extraction and dimensionali...
International audienceMining useful clusters from high dimensional data has received significant att...
When the data vectors are high-dimensional it is computationally infeasible to use data analysis or ...
Clustering as unsupervised learning method is the mission of dividing data objects into clusters wit...
The analysis of high-dimensional data often begins with the identification of lower dimensional subs...
Various dimensionality reduction (DR) schemes have been developed for projecting high-dimensional da...
Abstract- In this paper, a novel simple dimension reduction technique for classification is proposed...
Information explosion has occurred in most of the sciences and researches due to advances in data co...
We consider a clustering problem where we observe feature vectors Xi ∈ Rp, i = 1, 2,..., n, from K p...
Classification problem especially for high dimensional datasets have attracted many researchers in o...
Big databases are increasingly widespread and are therefore hard to understand, in exploratory biome...