In this project, two different machine learning models were tested in an attempt at imputing missing genotype data from patients on two different panels. As the integrity of the patients had to be protected, initial training was done on data simulated from the 1000 Genomes Project. The first model consisted of two convolutional variational autoencoders and the latent representations of the networks were shuffled to force the networks to find the same patterns in the two datasets. This model was unfortunately unsuccessful at imputing the missing data. The second model was based on a UNet structure and was more successful at the task of imputation. This model had one encoder for each dataset, making each encoder specialized at finding pattern...
Well-powered genomic studies require genome-wide marker coverage across many individuals. For non-mo...
We propose a general, theoretically justified mechanism for processing missing data by neural networ...
The accuracy and computational complexity of five methods to impute missing genotypes in high densit...
In this project, two different machine learning models were tested in an attempt at imputing missing...
In this project, a model based on a convolutional neural network have been developed with the aim of...
Genotype imputation estimates the genotypes of unobserved variants using the genotype data of other ...
Mendelian randomization studies typically have low power. Where there are several valid candidate ge...
Missing data imputation has been a hot topic in the past decade, and many state-of-the-art works hav...
<div><p>Missing data are an unavoidable component of modern statistical genetics. Different array or...
Due to complex experimental settings, missing values are common in biomedical data. To handle this i...
Abstract Background Imputation of missing genotypes is becoming a very popular solution for synchron...
BACKGROUND: Imputation of missing genotypes is becoming a very popular solution for synchronizing ge...
Genotype imputation has a wide range of applications in genome-wide association study (GWAS), includ...
Imputation is an in silico method that can increase the power of association studies by inferring mi...
<p>SNP data from a SNP array (SoySNP50K) or whole-genome resequencing (WGS) were used as a reference...
Well-powered genomic studies require genome-wide marker coverage across many individuals. For non-mo...
We propose a general, theoretically justified mechanism for processing missing data by neural networ...
The accuracy and computational complexity of five methods to impute missing genotypes in high densit...
In this project, two different machine learning models were tested in an attempt at imputing missing...
In this project, a model based on a convolutional neural network have been developed with the aim of...
Genotype imputation estimates the genotypes of unobserved variants using the genotype data of other ...
Mendelian randomization studies typically have low power. Where there are several valid candidate ge...
Missing data imputation has been a hot topic in the past decade, and many state-of-the-art works hav...
<div><p>Missing data are an unavoidable component of modern statistical genetics. Different array or...
Due to complex experimental settings, missing values are common in biomedical data. To handle this i...
Abstract Background Imputation of missing genotypes is becoming a very popular solution for synchron...
BACKGROUND: Imputation of missing genotypes is becoming a very popular solution for synchronizing ge...
Genotype imputation has a wide range of applications in genome-wide association study (GWAS), includ...
Imputation is an in silico method that can increase the power of association studies by inferring mi...
<p>SNP data from a SNP array (SoySNP50K) or whole-genome resequencing (WGS) were used as a reference...
Well-powered genomic studies require genome-wide marker coverage across many individuals. For non-mo...
We propose a general, theoretically justified mechanism for processing missing data by neural networ...
The accuracy and computational complexity of five methods to impute missing genotypes in high densit...