In this thesis, I study the problem of genome inference from short-read DNA sequencing data, with the goal of accurate characterisation of genomic regions with high sequence diversity. I describe a set of novel algorithms based on a generalised reference genome that captures sequence variation within a species. In Chapter 3, I propose a novel data structure that extends the traditional reference genome with known variants, providing a compressed representation of genetic diversity. I present algorithms to match sequencing reads to this extended reference structure and infer a personalised reference genome within close genetic distance from the sample under analysis. Coupled with existing variant calling tools, this personalised reference co...
Bacterial genetic variation originates through multiple mechanisms, including mutations during repli...
Despite being founded in the early 1920's, the field of Population and Evolutionary Genetics is curr...
Computational pangenomics is an emerging research field that is changing the way computer scientists...
Genotype imputation is a statistical technique that is often used to increase the power and resoluti...
Malaria elimination strategies require surveillance of the parasite population for genetic changes t...
The scale of the problems which human genomics is asked to solve necessitates that the field develop...
Malaria elimination strategies require surveillance of the parasite population for genetic changes t...
International audienceThe acquisition of a collection of individual genome sequences taken from a po...
Computational genomics involves the development and application of computational methods for whole-g...
In studies of human genome variation, researchers attempt to identify the DNA sequence differences b...
Malaria elimination strategies require surveillance of the parasite population for genetic changes t...
Despite being founded in the early 1920's, the field of Population and Evolutionary Genetics is curr...
Malaria elimination strategies require surveillance of the parasite population for genetic changes t...
After the complete genome sequence for several species, including human, has been determined, genomi...
I develop a number of mathematical and statistical models for the study of genomic variation within ...
Bacterial genetic variation originates through multiple mechanisms, including mutations during repli...
Despite being founded in the early 1920's, the field of Population and Evolutionary Genetics is curr...
Computational pangenomics is an emerging research field that is changing the way computer scientists...
Genotype imputation is a statistical technique that is often used to increase the power and resoluti...
Malaria elimination strategies require surveillance of the parasite population for genetic changes t...
The scale of the problems which human genomics is asked to solve necessitates that the field develop...
Malaria elimination strategies require surveillance of the parasite population for genetic changes t...
International audienceThe acquisition of a collection of individual genome sequences taken from a po...
Computational genomics involves the development and application of computational methods for whole-g...
In studies of human genome variation, researchers attempt to identify the DNA sequence differences b...
Malaria elimination strategies require surveillance of the parasite population for genetic changes t...
Despite being founded in the early 1920's, the field of Population and Evolutionary Genetics is curr...
Malaria elimination strategies require surveillance of the parasite population for genetic changes t...
After the complete genome sequence for several species, including human, has been determined, genomi...
I develop a number of mathematical and statistical models for the study of genomic variation within ...
Bacterial genetic variation originates through multiple mechanisms, including mutations during repli...
Despite being founded in the early 1920's, the field of Population and Evolutionary Genetics is curr...
Computational pangenomics is an emerging research field that is changing the way computer scientists...