This work was supported by MRC grant G1002319.This manuscript is concerned with relating two approaches that can be used to explore complex dependence structures between categorical variables, namely Bayesian partitioning of the covariate space incorporating a variable selection procedure that highlights the covariates that drive the clustering, and log-linear modelling with interaction terms. We derive theoretical results on this relation and discuss if they can be employed to assist log-linear model determination, demonstrating advantages and limitations with simulated and real data sets. The main advantage concerns sparse contingency tables. Inferences from clustering can potentially reduce the number of covariates considered and, subseq...
Categorical data in contingency tables are collected in many investigations. In order to underst and...
The R package bclust is useful for clustering high-dimensional continuous data. The package uses a p...
International audienceBeside CA and log-linear model, issued from the statistics domain, other resea...
AbstractThis manuscript is concerned with relating two approaches that can be used to explore comple...
This manuscript is concerned with relating two approaches that can be used to explore complex depend...
In several social and biomedical investigations the collected data can be classified into several ca...
In multi-dimensional contingency tables sparse data occur frequently. For example, with bi-nary como...
Generalized linear and additive models are very efficient regression tools but the selection of rele...
Statistical analysis of data sets of high-dimensionality has met great interest over the past years,...
Modeling interactions in regression models poses both computational as well as statistical challenge...
International audienceAn extension of the latent class model is presented for clustering categorical...
We deal with two-way contingency tables having ordered column categories.We use a row effects model ...
We propose a Bayesian approach to the Dirichlet-Multinomial (DM) regression model, which uses horses...
Thesis (Ph.D.)--University of Rochester. School of Medicine & Dentistry. Dept. of Biostatistics and ...
<p>Clustering methods are designed to separate heterogeneous data into groups of similar objects suc...
Categorical data in contingency tables are collected in many investigations. In order to underst and...
The R package bclust is useful for clustering high-dimensional continuous data. The package uses a p...
International audienceBeside CA and log-linear model, issued from the statistics domain, other resea...
AbstractThis manuscript is concerned with relating two approaches that can be used to explore comple...
This manuscript is concerned with relating two approaches that can be used to explore complex depend...
In several social and biomedical investigations the collected data can be classified into several ca...
In multi-dimensional contingency tables sparse data occur frequently. For example, with bi-nary como...
Generalized linear and additive models are very efficient regression tools but the selection of rele...
Statistical analysis of data sets of high-dimensionality has met great interest over the past years,...
Modeling interactions in regression models poses both computational as well as statistical challenge...
International audienceAn extension of the latent class model is presented for clustering categorical...
We deal with two-way contingency tables having ordered column categories.We use a row effects model ...
We propose a Bayesian approach to the Dirichlet-Multinomial (DM) regression model, which uses horses...
Thesis (Ph.D.)--University of Rochester. School of Medicine & Dentistry. Dept. of Biostatistics and ...
<p>Clustering methods are designed to separate heterogeneous data into groups of similar objects suc...
Categorical data in contingency tables are collected in many investigations. In order to underst and...
The R package bclust is useful for clustering high-dimensional continuous data. The package uses a p...
International audienceBeside CA and log-linear model, issued from the statistics domain, other resea...