International audienceActive learning for semi-supervised clustering allows algorithms to solicit a domain expert to provide side information as instances constraints, for example a set of labeled instances called seeds. The problem consists in selecting the queries to the expert that are likely to improve either the relevance or the quality of the proposed clustering. However, these active methods suffer from several limitations: (i) they are generally tailored for only one specific clustering paradigm or cluster shape and size, (ii) they may be counter-productive if the seeds are not selected in an appropriate manner and, (iii) they have to work efficiently with minimal expert supervision. In this paper, we propose a new active seed selec...
This article proposes a constrained clustering algorithm with competitive performance and less compu...
Constrained clustering is intended to improve accuracy and personalization based on the constraints ...
Clustering is inherently ill-posed: there often exist multiple valid clusterings of a single dataset...
International audienceIn this paper we address the problem of active query selection for clustering ...
Due to strong demand for the ability to enforce top-down struc-ture on clustering results, semi-supe...
National audienceThe success of machine learning approaches to solving real-world problems motivated...
One of the key tools to gain knowledge from data is clustering: identifying groups of instances that...
Semi-supervised clustering methods incorporate a limited amount of supervision into the clustering p...
In many machine learning domains (e.g. text processing, bioinformatics), there is a large supply of ...
Clustering is the task of finding groups of elements that are highly similar. It is one of the key t...
International audienceClustering is an unsupervised process which aims to discover regularities and ...
Abstract—Semi-supervised clustering seeks to augment traditional clustering methods by incorporating...
Constraint-based clustering leverages user-provided constraints to produce a clustering that matches...
We consider the problem of clustering n items into K disjoint clusters using noisy answers from crow...
The paper presents the new approach to the semi-supervised fuzzy clustering, which is based on the ...
This article proposes a constrained clustering algorithm with competitive performance and less compu...
Constrained clustering is intended to improve accuracy and personalization based on the constraints ...
Clustering is inherently ill-posed: there often exist multiple valid clusterings of a single dataset...
International audienceIn this paper we address the problem of active query selection for clustering ...
Due to strong demand for the ability to enforce top-down struc-ture on clustering results, semi-supe...
National audienceThe success of machine learning approaches to solving real-world problems motivated...
One of the key tools to gain knowledge from data is clustering: identifying groups of instances that...
Semi-supervised clustering methods incorporate a limited amount of supervision into the clustering p...
In many machine learning domains (e.g. text processing, bioinformatics), there is a large supply of ...
Clustering is the task of finding groups of elements that are highly similar. It is one of the key t...
International audienceClustering is an unsupervised process which aims to discover regularities and ...
Abstract—Semi-supervised clustering seeks to augment traditional clustering methods by incorporating...
Constraint-based clustering leverages user-provided constraints to produce a clustering that matches...
We consider the problem of clustering n items into K disjoint clusters using noisy answers from crow...
The paper presents the new approach to the semi-supervised fuzzy clustering, which is based on the ...
This article proposes a constrained clustering algorithm with competitive performance and less compu...
Constrained clustering is intended to improve accuracy and personalization based on the constraints ...
Clustering is inherently ill-posed: there often exist multiple valid clusterings of a single dataset...