Part 5: Classification - ClusteringInternational audienceIn many cases of high dimensional data analysis, data points may lie on manifolds of very complex shapes/geometries. Thus, the usual Euclidean distance may lead to suboptimal results when utilized in clustering or visualization operations. In this work, we introduce a new distance definition in multi-dimensional spaces that preserves the topology of the data point manifold. The parameters of the proposed distance are discussed and their physical meaning is explored through 2 and 3-dimensional synthetic datasets. A robust method for the parameterization of the algorithm is suggested. Finally, a modification of the well-known k-means clustering algorithm is introduced, to exploit the be...
One of the fundamental tasks of unsupervised learning is dataset clustering, to partition the input ...
Cluster analysis comprises of several unsupervised techniques aiming to identify a subgroup (cluster...
Many machine learning algorithms for clustering or dimensionality re-duction take as input a cloud o...
In real-world pattern recognition tasks, the data with multiple manifolds structure is ubiquitous an...
In order to address high dimensional problems, a new ‘direction-aware’ metric is introduced in this ...
Clustering partitions a collection of objects into groups called clusters, such that similar objects...
In this paper we address the problem of high-dimensionality for data that lies on complex manifolds....
There are many distance-based methods for classification and clustering, and for data with a high nu...
Several methods in data and shape analysis can be regarded as transformations between metric spaces....
The distribution of distances between points in a high-dimensional data set tends to look quite diff...
An important research topic of the recent years has been to understand and analyze data collections ...
An important research topic of the recent years has been to understand and analyze manifold-modeled ...
Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional dat...
In this paper, the problem of clustering rotationally invariant shapes is studied and a solution usi...
Abstract. In recent years, the eect of the curse of high dimensionality has been studied in great de...
One of the fundamental tasks of unsupervised learning is dataset clustering, to partition the input ...
Cluster analysis comprises of several unsupervised techniques aiming to identify a subgroup (cluster...
Many machine learning algorithms for clustering or dimensionality re-duction take as input a cloud o...
In real-world pattern recognition tasks, the data with multiple manifolds structure is ubiquitous an...
In order to address high dimensional problems, a new ‘direction-aware’ metric is introduced in this ...
Clustering partitions a collection of objects into groups called clusters, such that similar objects...
In this paper we address the problem of high-dimensionality for data that lies on complex manifolds....
There are many distance-based methods for classification and clustering, and for data with a high nu...
Several methods in data and shape analysis can be regarded as transformations between metric spaces....
The distribution of distances between points in a high-dimensional data set tends to look quite diff...
An important research topic of the recent years has been to understand and analyze data collections ...
An important research topic of the recent years has been to understand and analyze manifold-modeled ...
Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional dat...
In this paper, the problem of clustering rotationally invariant shapes is studied and a solution usi...
Abstract. In recent years, the eect of the curse of high dimensionality has been studied in great de...
One of the fundamental tasks of unsupervised learning is dataset clustering, to partition the input ...
Cluster analysis comprises of several unsupervised techniques aiming to identify a subgroup (cluster...
Many machine learning algorithms for clustering or dimensionality re-duction take as input a cloud o...