Social scientists spend considerable energy constructing typologies and discussing their roles in mea-surement. Less discussed is the role of typologies in evaluating and revising theoretical arguments. We argue that unsupervised machine learning tools can be profitably applied to the development and testing of theory-based typologies. We review recent advances in mixture models as applied to cluster analysis and argue that these tools are particularly important in the social sciences where it is common to claim that high-dimensional objects group together in meaningful clusters. Model-based clustering (MBC) grounds analysis in probability theory, permitting the evaluation of uncertainty and application of information-based model selection ...
Abstract. This talk is an attempt at structuring and systematising the develop-ment of clustering as...
Variable selection is an important problem for cluster analysis of high-dimensional data. It is also...
Clustering is a widely used statistical tool to determine subsets in a given data set. Frequently us...
. Data with mixed-type (metricordinalnominal) variables are typical for social stratification, i.e. ...
Discussion on "Data with mixed‐type (metric–ordinal–nominal) variables are typical for social strati...
This thesis applies model-based cluster analysis to data concerning types of democracies, creating a...
Clustering seeks to group or to lump together objects or variables that share some observed qualitie...
Variable selection for cluster analysis is a difficult problem. The difficulty originates not only f...
In clustering, one may be interested in the classification of similar objects into groups, and one m...
There are many algorithms to cluster sample data points based on nearness or a similar-ity measure. ...
Unsupervised learning is widely recognized as one of the most important challenges facing machine le...
Thesis (Ph.D.)--University of Rochester. School of Medicine & Dentistry. Dept. of Biostatistics & Co...
Cluster analysis seeks to identify homogeneous subgroups of cases in a population. This article prov...
The use of typologies for measurement has received considerable attention. Less discussed is their r...
Cluster analysis comprises a range of methods for classifying multivariate data into subgroups. By o...
Abstract. This talk is an attempt at structuring and systematising the develop-ment of clustering as...
Variable selection is an important problem for cluster analysis of high-dimensional data. It is also...
Clustering is a widely used statistical tool to determine subsets in a given data set. Frequently us...
. Data with mixed-type (metricordinalnominal) variables are typical for social stratification, i.e. ...
Discussion on "Data with mixed‐type (metric–ordinal–nominal) variables are typical for social strati...
This thesis applies model-based cluster analysis to data concerning types of democracies, creating a...
Clustering seeks to group or to lump together objects or variables that share some observed qualitie...
Variable selection for cluster analysis is a difficult problem. The difficulty originates not only f...
In clustering, one may be interested in the classification of similar objects into groups, and one m...
There are many algorithms to cluster sample data points based on nearness or a similar-ity measure. ...
Unsupervised learning is widely recognized as one of the most important challenges facing machine le...
Thesis (Ph.D.)--University of Rochester. School of Medicine & Dentistry. Dept. of Biostatistics & Co...
Cluster analysis seeks to identify homogeneous subgroups of cases in a population. This article prov...
The use of typologies for measurement has received considerable attention. Less discussed is their r...
Cluster analysis comprises a range of methods for classifying multivariate data into subgroups. By o...
Abstract. This talk is an attempt at structuring and systematising the develop-ment of clustering as...
Variable selection is an important problem for cluster analysis of high-dimensional data. It is also...
Clustering is a widely used statistical tool to determine subsets in a given data set. Frequently us...