This manuscript is concerned with relating two approaches that can be used to explore complex dependence structures between categorical variables, namely Bayesian partitioning of the covariate space incorporating a variable selection procedure that highlights the covariates that drive the clustering, and log-linear modelling with interaction terms. We derive theoretical results on this relation and discuss if they can be employed to assist log-linear model determination, demonstrating advantages and limitations with simulated and real data sets. The main advantage concerns sparse contingency tables. Inferences from clustering can potentially reduce the number of covariates considered and, subsequently, the number of competing log-linear mod...
This thesis is concerned with the analysis of cross-classified categorical data from complex sample ...
Traditional clustering methods focus on grouping subjects or (dependent) variables assuming independ...
International audienceIn model-based clustering, each cluster is modelled by a parametrised probabil...
AbstractThis manuscript is concerned with relating two approaches that can be used to explore comple...
This work was supported by MRC grant G1002319.This manuscript is concerned with relating two approac...
In several social and biomedical investigations the collected data can be classified into several ca...
We deal with two-way contingency tables having ordered column categories.We use a row effects model ...
Statistical analysis of data sets of high-dimensionality has met great interest over the past years,...
Generalized linear and additive models are very efficient regression tools but the selection of rele...
The currently available variable selection procedures in model-based clustering assume that the irre...
In multi-dimensional contingency tables sparse data occur frequently. For example, with bi-nary como...
Standard regression analyses are often plagued with problems encountered when one tries to make infe...
Log-linear models are useful for analyzing cross-classifications of counts arising in sociology, but...
Categorical data in contingency tables are collected in many investigations. In order to underst and...
Standard regression analyses are often plagued with problems encountered when one tries to make mean...
This thesis is concerned with the analysis of cross-classified categorical data from complex sample ...
Traditional clustering methods focus on grouping subjects or (dependent) variables assuming independ...
International audienceIn model-based clustering, each cluster is modelled by a parametrised probabil...
AbstractThis manuscript is concerned with relating two approaches that can be used to explore comple...
This work was supported by MRC grant G1002319.This manuscript is concerned with relating two approac...
In several social and biomedical investigations the collected data can be classified into several ca...
We deal with two-way contingency tables having ordered column categories.We use a row effects model ...
Statistical analysis of data sets of high-dimensionality has met great interest over the past years,...
Generalized linear and additive models are very efficient regression tools but the selection of rele...
The currently available variable selection procedures in model-based clustering assume that the irre...
In multi-dimensional contingency tables sparse data occur frequently. For example, with bi-nary como...
Standard regression analyses are often plagued with problems encountered when one tries to make infe...
Log-linear models are useful for analyzing cross-classifications of counts arising in sociology, but...
Categorical data in contingency tables are collected in many investigations. In order to underst and...
Standard regression analyses are often plagued with problems encountered when one tries to make mean...
This thesis is concerned with the analysis of cross-classified categorical data from complex sample ...
Traditional clustering methods focus on grouping subjects or (dependent) variables assuming independ...
International audienceIn model-based clustering, each cluster is modelled by a parametrised probabil...