We discuss topological aspects of cluster analysis and show that inferring the topological structure of a dataset before clustering it can considerably enhance cluster detection: theoretical arguments and empirical evidence show that clustering embedding vectors, representing the structure of a data manifold instead of the observed feature vectors themselves, is highly beneficial. To demonstrate, we combine manifold learning method UMAP for inferring the topological structure with density-based clustering method DBSCAN. Synthetic and real data results show that this both simplifies and improves clustering in a diverse set of low- and high-dimensional problems including clusters of varying density and/or entangled shapes. Our approach simpli...
High-dimensional data is increasingly becoming common because of its rich information content that c...
Abstract. Subspace clustering (also called projected clustering) addresses the problem that differen...
Adaptive Resonance Theory (ART) is considered as an effective approach for realizing continual learn...
This work studies the application of topological analysis to non-linear manifold clustering. A novel...
International audienceThe exponential growth of data generates terabytes of very large databases. Th...
International audienceBackgroundThis paper exploits recent developments in topological data analysis...
One of the fundamental tasks of unsupervised learning is dataset clustering, to partition the input ...
Clustering is one of the most used data mining techniques, while computational topology is a very re...
In real-world pattern recognition tasks, the data with multiple manifolds structure is ubiquitous an...
An important research topic of the recent years has been to understand and analyze manifold-modeled ...
To classify objects based on their features and characteristics is one of the most important and pri...
This thesis is about visualizing a kind of data that is trivial to process by computers but difficul...
Abstract. Manifold clustering, which regards clusters as groups of points around compact manifolds, ...
Part 5: Classification - ClusteringInternational audienceIn many cases of high dimensional data anal...
The growing neural gas (GNG) is an unsupervised topology learning algorithm that models a data space...
High-dimensional data is increasingly becoming common because of its rich information content that c...
Abstract. Subspace clustering (also called projected clustering) addresses the problem that differen...
Adaptive Resonance Theory (ART) is considered as an effective approach for realizing continual learn...
This work studies the application of topological analysis to non-linear manifold clustering. A novel...
International audienceThe exponential growth of data generates terabytes of very large databases. Th...
International audienceBackgroundThis paper exploits recent developments in topological data analysis...
One of the fundamental tasks of unsupervised learning is dataset clustering, to partition the input ...
Clustering is one of the most used data mining techniques, while computational topology is a very re...
In real-world pattern recognition tasks, the data with multiple manifolds structure is ubiquitous an...
An important research topic of the recent years has been to understand and analyze manifold-modeled ...
To classify objects based on their features and characteristics is one of the most important and pri...
This thesis is about visualizing a kind of data that is trivial to process by computers but difficul...
Abstract. Manifold clustering, which regards clusters as groups of points around compact manifolds, ...
Part 5: Classification - ClusteringInternational audienceIn many cases of high dimensional data anal...
The growing neural gas (GNG) is an unsupervised topology learning algorithm that models a data space...
High-dimensional data is increasingly becoming common because of its rich information content that c...
Abstract. Subspace clustering (also called projected clustering) addresses the problem that differen...
Adaptive Resonance Theory (ART) is considered as an effective approach for realizing continual learn...