Several large scale data mining applications, such as text categorization and gene expression analysis, involve high-dimensional data that is also inherently directional in nature. Often such data is L2 normalized so that it lies on the surface of a unit hypersphere. Popular models such as (mixtures of) multi-variate Gaussians are inadequate for characterizing such data. This paper proposes a generative mixture-model approach to clustering directional data based on the von Mises-Fisher (vMF) distribution, which arises naturally for data distributed on the unit hypersphere. In particular, we derive and analyze two variants of the Expectation Maximization (EM) framework for estimating the mean and concentration parameters of this mixture. Num...
Mixtures of von Mises-Fisher distributions have been shown to be an effective model for clustering d...
Finite mixtures of von Mises-Fisher distributions allow to apply model-based clustering methods to d...
In contemporary life directional data are present in most areas, in several forms, aspects and large...
Statistical tools like the finite mixture models and model-based clustering have been used extensive...
Machine learning applications often involve data that can be analyzed as unit vectors on a d-dimensi...
<p>This paper proposes a suite of models for clustering high-dimensional data on a unit sphere based...
International audienceMixtures of von Mises-Fisher distributions can be used to cluster data on the ...
International audienceMixtures of von Mises-Fisher distributions can be used to cluster data on the ...
In this paper, we propose a complete method for clustering data, which are in the form of unit vecto...
International audienceIn this paper, we propose a complete method for clustering data, which are in ...
High-dimensional data is central to most data mining applications, and only recently has it been mod...
Cluster analysis or clustering, which aims to group together similar objects, is undoubtedly a very ...
Structural regularities in man-made environments reflect in the distribution of their surface normal...
Finite mixtures of von Mises-Fisher distributions allow to apply model-based clustering methods to d...
Abstract Finite mixture models are being commonly used in a wide range of ap-plications in practice ...
Mixtures of von Mises-Fisher distributions have been shown to be an effective model for clustering d...
Finite mixtures of von Mises-Fisher distributions allow to apply model-based clustering methods to d...
In contemporary life directional data are present in most areas, in several forms, aspects and large...
Statistical tools like the finite mixture models and model-based clustering have been used extensive...
Machine learning applications often involve data that can be analyzed as unit vectors on a d-dimensi...
<p>This paper proposes a suite of models for clustering high-dimensional data on a unit sphere based...
International audienceMixtures of von Mises-Fisher distributions can be used to cluster data on the ...
International audienceMixtures of von Mises-Fisher distributions can be used to cluster data on the ...
In this paper, we propose a complete method for clustering data, which are in the form of unit vecto...
International audienceIn this paper, we propose a complete method for clustering data, which are in ...
High-dimensional data is central to most data mining applications, and only recently has it been mod...
Cluster analysis or clustering, which aims to group together similar objects, is undoubtedly a very ...
Structural regularities in man-made environments reflect in the distribution of their surface normal...
Finite mixtures of von Mises-Fisher distributions allow to apply model-based clustering methods to d...
Abstract Finite mixture models are being commonly used in a wide range of ap-plications in practice ...
Mixtures of von Mises-Fisher distributions have been shown to be an effective model for clustering d...
Finite mixtures of von Mises-Fisher distributions allow to apply model-based clustering methods to d...
In contemporary life directional data are present in most areas, in several forms, aspects and large...