Given a set X of ``empirical" points, whose coordinates are perturbed by errors, we analyze whether it contains redundant information, that is whether some of its elements could be represented by a single equivalent point. If this is the case, the empirical information associated to X could be described by fewer points, chosen in a suitable way. We present two different methods to reduce the cardinality of X which compute a new set of points equivalent to the original one, that is representing the same empirical information. Though our algorithms use basic notions of Cluster Analysis they are specifically designed for ``thinning out" redundant data. We include some experimental results which illustrate the practical effectiveness of our m...
Since the vehicle program in the automotive industry gets more and more extensive, the costs related...
Problems in data analysis often require the unsupervised partitioning of a data set into clusters. M...
Robust methods are needed to fit regression lines when outliers are present. In a clustering framewo...
Real life datasets used in marketing studies contain a lot of redundant features which may prevent d...
A clustering is an implicit assignment of labels of points, based on proximity to other points. It i...
We derive a new clustering algorithm based on information theory and statistical mechanics, which is...
Cluster Analysis aims at finding subsets (clusters) of a given set of entities, which are homogeneou...
The general area of this research is data clustering, in which an unsupervised classification proces...
International audienceMultiblock component methods are applied to data sets for which several blocks...
We propose a novel method for clustering data which is grounded in information-theoretic prin-ciples...
There are many algorithms to cluster sample data points based on nearness or a similar-ity measure. ...
Since the vehicle program in the automotive industry gets more and more extensive, the costs related...
Clustering is part of data mining where data mining is a process in which it is used to analyze data...
Clustering seeks to group or to lump together objects or variables that share some observed qualitie...
The aim of this article is to propose a procedure to cluster functional observations in a subspace ...
Since the vehicle program in the automotive industry gets more and more extensive, the costs related...
Problems in data analysis often require the unsupervised partitioning of a data set into clusters. M...
Robust methods are needed to fit regression lines when outliers are present. In a clustering framewo...
Real life datasets used in marketing studies contain a lot of redundant features which may prevent d...
A clustering is an implicit assignment of labels of points, based on proximity to other points. It i...
We derive a new clustering algorithm based on information theory and statistical mechanics, which is...
Cluster Analysis aims at finding subsets (clusters) of a given set of entities, which are homogeneou...
The general area of this research is data clustering, in which an unsupervised classification proces...
International audienceMultiblock component methods are applied to data sets for which several blocks...
We propose a novel method for clustering data which is grounded in information-theoretic prin-ciples...
There are many algorithms to cluster sample data points based on nearness or a similar-ity measure. ...
Since the vehicle program in the automotive industry gets more and more extensive, the costs related...
Clustering is part of data mining where data mining is a process in which it is used to analyze data...
Clustering seeks to group or to lump together objects or variables that share some observed qualitie...
The aim of this article is to propose a procedure to cluster functional observations in a subspace ...
Since the vehicle program in the automotive industry gets more and more extensive, the costs related...
Problems in data analysis often require the unsupervised partitioning of a data set into clusters. M...
Robust methods are needed to fit regression lines when outliers are present. In a clustering framewo...