This thesis is devoted to the development of a new statistical model for segmentation/clustering problems. The objective is to partition the data into homogeneous regions and to cluster these regions into a finite number of groups. Segmentation/clustering problems are traditionally studied with hidden Markov models. We propose an alternative model which combines segmentation models and mixture models. We construct our model in the Gaussian case and we propose a generalization to discrete dependent variables. The parameters of the model are estimated by maximum likelihood with a hybrid algorithm based on dynamic programming and on the EM algorithm. We study a new model selection problem which is the simultaneous selection of the number of ...
We propose to model the output of transcriptome sequencing technologies (RNA-Seq) using the negative...
This work shows how one can determine an optimal combination of clustering algorithms by performing ...
Eukaryotic genomes display segmental patterns of variation in various properties, including GC conte...
Abstract. Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizi...
Statistical machine learning is a branch of mathematics concerned with developing algorithms for dat...
In many biological applications it is necessary to cluster DNA sequences into groups that represent ...
This work focuses on the problem of point and variable clustering, that is the grouping of either si...
I tackle the problem of partitioning a sequence into homogeneous segments, where homogeneity is defi...
M. Jean MACCARIO Président M. Dominique CELLIER Examinateur M. Christian GAUTIER Rapporteur Mme Chan...
We are interested in variable selection for clustering with Gaussian mixture models. This research i...
Genomic data from DNA microarray or sequencing technologies have two major characteristics: their hi...
Motivation: DNA segmentation, i.e. the partitioning of DNA in compositionally homogeneous segments, ...
The research work presented in this dissertation is on keeping with the statistical integration of p...
We address the issue of clustering individuals from " complex " observations in the sense that they ...
Nous proposons de modéliser les données issues des technologies de séquençage du transcriptome (RNA-...
We propose to model the output of transcriptome sequencing technologies (RNA-Seq) using the negative...
This work shows how one can determine an optimal combination of clustering algorithms by performing ...
Eukaryotic genomes display segmental patterns of variation in various properties, including GC conte...
Abstract. Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizi...
Statistical machine learning is a branch of mathematics concerned with developing algorithms for dat...
In many biological applications it is necessary to cluster DNA sequences into groups that represent ...
This work focuses on the problem of point and variable clustering, that is the grouping of either si...
I tackle the problem of partitioning a sequence into homogeneous segments, where homogeneity is defi...
M. Jean MACCARIO Président M. Dominique CELLIER Examinateur M. Christian GAUTIER Rapporteur Mme Chan...
We are interested in variable selection for clustering with Gaussian mixture models. This research i...
Genomic data from DNA microarray or sequencing technologies have two major characteristics: their hi...
Motivation: DNA segmentation, i.e. the partitioning of DNA in compositionally homogeneous segments, ...
The research work presented in this dissertation is on keeping with the statistical integration of p...
We address the issue of clustering individuals from " complex " observations in the sense that they ...
Nous proposons de modéliser les données issues des technologies de séquençage du transcriptome (RNA-...
We propose to model the output of transcriptome sequencing technologies (RNA-Seq) using the negative...
This work shows how one can determine an optimal combination of clustering algorithms by performing ...
Eukaryotic genomes display segmental patterns of variation in various properties, including GC conte...