International audienceWe examine the feasibility of using synthetic medical data generated by GANs in the classroom, to teach data science in health infor-matics. We present an end-to-end methodology to retain instructional utility, while preserving privacy to a level, which meets regulatory requirements: (1) a GAN is trained by a certified medical-data security-aware agent, inside a secure environment; (2) the final GAN model is used outside of the secure environment by external users (instructors or researchers) to generate synthetic data. This second step facilitates data handling for external users, by avoiding de-identification, which may require special user training, be costly, and/or cause loss of data fidelity. We benchmark our pro...
A large amount of personal health data that is highly valuable to the scientific community is still ...
Advances in computation have created high demand for large datasets, which in turn has sparked inter...
International audienceA vast amount of crucial information about patients resides solely in unstruct...
International audienceWe examine the feasibility of using synthetic medical data generated by GANs i...
International audienceWe develop metrics for measuring the quality of synthetic health data for both...
Synthetic health data have the potential to mitigate privacy concerns when sharing data to support b...
With the rapid advancements in machine learning, the health care paradigm is shifting from treatment...
High-quality tabular data is a crucial requirement for developing data-driven applications, especial...
International audienceGenerating synthetic data represents an attractive solution for creating open ...
Hospitals and General Practitioner (GP) surgeries within National Health Services (NHS), collect pat...
International audienceInstitutions collect massive learning traces but they may not disclose it for ...
The growing development of artificial intelligence (AI), particularly neural networks, is transformi...
These talks were presented for the Privacy Day Webinar 2022 sponsored by the American Statistical As...
A large amount of personal health data that is highly valuable to the scientific community is still ...
Advances in computation have created high demand for large datasets, which in turn has sparked inter...
International audienceA vast amount of crucial information about patients resides solely in unstruct...
International audienceWe examine the feasibility of using synthetic medical data generated by GANs i...
International audienceWe develop metrics for measuring the quality of synthetic health data for both...
Synthetic health data have the potential to mitigate privacy concerns when sharing data to support b...
With the rapid advancements in machine learning, the health care paradigm is shifting from treatment...
High-quality tabular data is a crucial requirement for developing data-driven applications, especial...
International audienceGenerating synthetic data represents an attractive solution for creating open ...
Hospitals and General Practitioner (GP) surgeries within National Health Services (NHS), collect pat...
International audienceInstitutions collect massive learning traces but they may not disclose it for ...
The growing development of artificial intelligence (AI), particularly neural networks, is transformi...
These talks were presented for the Privacy Day Webinar 2022 sponsored by the American Statistical As...
A large amount of personal health data that is highly valuable to the scientific community is still ...
Advances in computation have created high demand for large datasets, which in turn has sparked inter...
International audienceA vast amount of crucial information about patients resides solely in unstruct...