This is an Open Access article published under a Creative Commons Attribution 3.0 Unported (CC BY 3.0) license https://creativecommons.org/licenses/by/3.0/Cluster analysis faces two problems in high dimensions: the "curse of dimensionality" that can lead to overfitting and poor generalization performance and the sheer time taken for conventional algorithms to process large amounts of high-dimensional data. We describe a solution to these problems, designed for the application of spike sorting for nextgeneration, high-channel-count neural probes. In this problem, only a small subset of features provides information about the cluster membership of any one data vector, but this informative feature subset is not the same for all data points, re...
Abstract — In machine learning, feature selection is preprocessing step and can be effectively reduc...
International audienceClustering in high-dimensional spaces is a recurrent problem in many domains, ...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
This is an Open Access article published under a Creative Commons Attribution 3.0 Unported (CC BY 3....
Cluster analysis faces two problems in high dimensions: first, the “curse of di-mensionality ” that ...
A critical problem faced in many scientific fields is the adequate separation of data derived from i...
Existing clustering algorithms have difficulty finding the correct locations of potential clusters i...
Data mining is one of the long known research topics, which is making a comeback especially with the...
A data set may contain of one or more 'clouds' of data objects. The task for cluster analysis is, to...
More and more data are produced every day. Some clustering techniques have been developed to automat...
We present a nonparametric method for selecting informative features in high-dimensional clustering ...
We propose a new Gaussian clustering method named EM-FDA for feature extraction in high dimensional ...
International audienceClustering in high-dimensional spaces is a difficult problem which is recurren...
Machine learning is used nowadays to build models for classification and regression tasks, among oth...
Background: High-dimensional biomedical data are frequently clustered to identify subgroup structure...
Abstract — In machine learning, feature selection is preprocessing step and can be effectively reduc...
International audienceClustering in high-dimensional spaces is a recurrent problem in many domains, ...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
This is an Open Access article published under a Creative Commons Attribution 3.0 Unported (CC BY 3....
Cluster analysis faces two problems in high dimensions: first, the “curse of di-mensionality ” that ...
A critical problem faced in many scientific fields is the adequate separation of data derived from i...
Existing clustering algorithms have difficulty finding the correct locations of potential clusters i...
Data mining is one of the long known research topics, which is making a comeback especially with the...
A data set may contain of one or more 'clouds' of data objects. The task for cluster analysis is, to...
More and more data are produced every day. Some clustering techniques have been developed to automat...
We present a nonparametric method for selecting informative features in high-dimensional clustering ...
We propose a new Gaussian clustering method named EM-FDA for feature extraction in high dimensional ...
International audienceClustering in high-dimensional spaces is a difficult problem which is recurren...
Machine learning is used nowadays to build models for classification and regression tasks, among oth...
Background: High-dimensional biomedical data are frequently clustered to identify subgroup structure...
Abstract — In machine learning, feature selection is preprocessing step and can be effectively reduc...
International audienceClustering in high-dimensional spaces is a recurrent problem in many domains, ...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...