Motivated by issues raised by the analysis of gene expressions data, this thesis focuses on the impact of dependence on the properties of multiple testing procedures for high-dimensional data. We propose a methodology based on a Factor Analysis model for the correlation structure. Model parameters are estimated thanks to an em algorithm and an ad hoc methodology allowing to determine the model that fits best the covariance structure is defined. Moreover, the factor structure provides a general framework to deal with dependence in multiple testing. Two main issues are more particularly considered : the estimation of the proportion of true null hypotheses, and the control of error rates. The proposed framework leads to less variability in the...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
Motivé par des applications dans le domaine de l'analyse de données génomiques, ce travail de thèse ...
The R package FAMT (factor analysis for multiple testing) provides a powerful method for large-scale...
The R package FAMT (factor analysis for multiple testing) provides a powerful method for large-scale...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
The R package FAMT (factor analysis for multiple testing) provides a powerful method for large-scale...
International audienceMultiple testing issues have long been considered almost exclusively in the co...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
Motivé par des applications dans le domaine de l'analyse de données génomiques, ce travail de thèse ...
The R package FAMT (factor analysis for multiple testing) provides a powerful method for large-scale...
The R package FAMT (factor analysis for multiple testing) provides a powerful method for large-scale...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
International audienceThe data generated by high-throughput biotechnologies are characterized by the...
The R package FAMT (factor analysis for multiple testing) provides a powerful method for large-scale...
International audienceMultiple testing issues have long been considered almost exclusively in the co...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...
This thesis deals with statistical questions raised by the analysis of high-dimensional genomic data...