Background: Assurance of digital health interventions involves, amongst others, clinical validation, which requires large datasets to test the application in realistic clinical scenarios. Development of such datasets is time consuming and challenging in terms of maintaining patient anonymity and consent. Objective: The development of synthetic datasets that maintain the statistical properties of the real datasets. Method: An artificial neural network based, generative adversarial network was implemented and trained, using numerical and categorical variables, including ICD-9 codes from the MIMIC III dataset, to produce a synthetic dataset. Results: The synthetic dataset, exhibits a correlation matrix highly similar to the real dataset...
The work presented in this paper was funded by NHSX using the synthetic data generation and evaluati...
Modern machine and deep learning methods require large datasets to achieve reliable and robust resul...
Realistic synthetic data are increasingly being recognized as solutions to lack of data or privacy c...
Digital health applications can improve quality and effectiveness of healthcare, by offering a numbe...
Background Digital health applications can improve quality and effectiveness of healthcare, by offer...
Restrictions in sharing Patient Health Identifiers (PHI) limit cross-organizational re-use of free-t...
The development of healthcare patient digital twins in combination with machine learning technologie...
The development of healthcare patient digital twins in combination with machine learning technologie...
The development of healthcare patient digital twins in combination with machine learning technologie...
International audienceWe develop metrics for measuring the quality of synthetic health data for both...
High-quality tabular data is a crucial requirement for developing data-driven applications, especial...
International audienceWe develop metrics for measuring the quality of synthetic health data for both...
International audienceWe develop metrics for measuring the quality of synthetic health data for both...
High-quality tabular data is a crucial requirement for developing data-driven applications, especial...
Synthetic health data have the potential to mitigate privacy concerns when sharing data to support b...
The work presented in this paper was funded by NHSX using the synthetic data generation and evaluati...
Modern machine and deep learning methods require large datasets to achieve reliable and robust resul...
Realistic synthetic data are increasingly being recognized as solutions to lack of data or privacy c...
Digital health applications can improve quality and effectiveness of healthcare, by offering a numbe...
Background Digital health applications can improve quality and effectiveness of healthcare, by offer...
Restrictions in sharing Patient Health Identifiers (PHI) limit cross-organizational re-use of free-t...
The development of healthcare patient digital twins in combination with machine learning technologie...
The development of healthcare patient digital twins in combination with machine learning technologie...
The development of healthcare patient digital twins in combination with machine learning technologie...
International audienceWe develop metrics for measuring the quality of synthetic health data for both...
High-quality tabular data is a crucial requirement for developing data-driven applications, especial...
International audienceWe develop metrics for measuring the quality of synthetic health data for both...
International audienceWe develop metrics for measuring the quality of synthetic health data for both...
High-quality tabular data is a crucial requirement for developing data-driven applications, especial...
Synthetic health data have the potential to mitigate privacy concerns when sharing data to support b...
The work presented in this paper was funded by NHSX using the synthetic data generation and evaluati...
Modern machine and deep learning methods require large datasets to achieve reliable and robust resul...
Realistic synthetic data are increasingly being recognized as solutions to lack of data or privacy c...