Data clustering is a well-known task in data mining and it often relies on distances or, in some cases, similarity measures. The latter is indeed the case for real world datasets that comprise categorical attributes. Several similarity measures have been proposed in the literature, however, their choice depends on the context and the dataset at hand. In this paper, we address the following question: given a set of measures, which one is best suited for clustering a particular dataset? We propose an approach to automate this choice, and we present an empirical study based on categorical datasets, on which we evaluate our proposed approach
Methods of data analysis and automatic processing are treated as knowledge discovery. In many cases ...
Abstract. Clustering data in Euclidean space has a long tradition and there has been considerable at...
Clustering is an unsupervised learning technique which aims at grouping a set of objects into cluste...
Data clustering is a well-known task in data mining and it often relies on distances or, in some cas...
International audienceData clustering is a well-known task in data mining and it often relies on dis...
Grouping objects that are described by attributes, or clustering is a central notion in data mining....
Abstract. The concept of similarity is fundamentally important in al-most every scientific field. Cl...
Clustering over categorical attributes is an important yet tough task. In this paper, we present a n...
This paper introduces a measure of similarity between two clusterings of the same dataset produced b...
Grouping objects that are described by attributes, or clustering is a central notion in data mining....
The data clustering, an unsupervised pattern recognition process is the task of assigning a set of o...
International audienceIn many domains, we face heterogeneous data with both numeric and categorical ...
Methods of data analysis and automatic processing are treated as knowledge discovery. In many cases ...
The development of analysis methods for categorical data begun in 90's decade, and it has been boomi...
Methods of data analysis and automatic processing are treated as knowledge discovery. In many cases ...
Methods of data analysis and automatic processing are treated as knowledge discovery. In many cases ...
Abstract. Clustering data in Euclidean space has a long tradition and there has been considerable at...
Clustering is an unsupervised learning technique which aims at grouping a set of objects into cluste...
Data clustering is a well-known task in data mining and it often relies on distances or, in some cas...
International audienceData clustering is a well-known task in data mining and it often relies on dis...
Grouping objects that are described by attributes, or clustering is a central notion in data mining....
Abstract. The concept of similarity is fundamentally important in al-most every scientific field. Cl...
Clustering over categorical attributes is an important yet tough task. In this paper, we present a n...
This paper introduces a measure of similarity between two clusterings of the same dataset produced b...
Grouping objects that are described by attributes, or clustering is a central notion in data mining....
The data clustering, an unsupervised pattern recognition process is the task of assigning a set of o...
International audienceIn many domains, we face heterogeneous data with both numeric and categorical ...
Methods of data analysis and automatic processing are treated as knowledge discovery. In many cases ...
The development of analysis methods for categorical data begun in 90's decade, and it has been boomi...
Methods of data analysis and automatic processing are treated as knowledge discovery. In many cases ...
Methods of data analysis and automatic processing are treated as knowledge discovery. In many cases ...
Abstract. Clustering data in Euclidean space has a long tradition and there has been considerable at...
Clustering is an unsupervised learning technique which aims at grouping a set of objects into cluste...