Missing data is a common concern in health datasets, and its impact on good decision-making processes is well documented. Our study's contribution is a methodology for tackling missing data problems using a combination of synthetic dataset generation, missing data imputation and deep learning methods to resolve missing data challenges. Specifically, we conducted a series of experiments with these objectives; $a)$ generating a realistic synthetic dataset, $b)$ simulating data missingness, $c)$ recovering the missing data, and $d)$ analyzing imputation performance. Our methodology used a gaussian mixture model whose parameters were learned from a cleaned subset of a real demographic and health dataset to generate the synthetic data. We simula...
One important characteristic of good data is completeness. Missing data is a major problem in the cl...
Missing data is one of the most common issues encountered in data cleaning process especially when d...
Missing data are prevalent in many public health studies for various reasons. For example, some subj...
The analysis of digital health data with machine learning models can be used in clinical application...
Missing data is a universal problem in analysing Real-World Evidence (RWE) datasets. In RWE datasets...
In real-life situations, we often encounter data sets containing missing observations. Statistical m...
International audienceBACKGROUND: As databases grow larger, it becomes harder to fully control their...
Missing data is one of the most common issues encountered in data cleaning process especially when d...
The performance evaluation of imputation algorithms often involves the generation of missing values...
Cardiovascular disease (CVD) is a class of diseases that involve the heart or blood vessels and acco...
This work considers the problem of learning with missing data. Two main classes of approaches are co...
Many real-world datasets suffer from missing data, which can introduce uncertainty into ensuing anal...
[[abstract]]While there is an ample amount of medical information available for data mining, many of...
Philosophiae Doctor - PhD (Statistics and Population Studies)The aim of this study is to look at the...
Missing data is a major problem in real-world datasets, which hinders the performance of data analyt...
One important characteristic of good data is completeness. Missing data is a major problem in the cl...
Missing data is one of the most common issues encountered in data cleaning process especially when d...
Missing data are prevalent in many public health studies for various reasons. For example, some subj...
The analysis of digital health data with machine learning models can be used in clinical application...
Missing data is a universal problem in analysing Real-World Evidence (RWE) datasets. In RWE datasets...
In real-life situations, we often encounter data sets containing missing observations. Statistical m...
International audienceBACKGROUND: As databases grow larger, it becomes harder to fully control their...
Missing data is one of the most common issues encountered in data cleaning process especially when d...
The performance evaluation of imputation algorithms often involves the generation of missing values...
Cardiovascular disease (CVD) is a class of diseases that involve the heart or blood vessels and acco...
This work considers the problem of learning with missing data. Two main classes of approaches are co...
Many real-world datasets suffer from missing data, which can introduce uncertainty into ensuing anal...
[[abstract]]While there is an ample amount of medical information available for data mining, many of...
Philosophiae Doctor - PhD (Statistics and Population Studies)The aim of this study is to look at the...
Missing data is a major problem in real-world datasets, which hinders the performance of data analyt...
One important characteristic of good data is completeness. Missing data is a major problem in the cl...
Missing data is one of the most common issues encountered in data cleaning process especially when d...
Missing data are prevalent in many public health studies for various reasons. For example, some subj...