Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal data. Repeated accesses to underlying data incur increasing loss. Releasing data as privacy-preserving synthetic data would avoid this limitation but would leave open the problem of designing what kind of synthetic data. We propose formulating the problem of private data release through probabilistic modeling. This approach transforms the problem of designing the synthetic data into choosing a model for the data, allowing also the inclusion of prior knowledge, which improves the quality of the synthetic data. We demonstrate empirically, in an epidemiological study, that statistical discoveries can be reliably reproduced from the synthetic dat...
The availability of genomic data is essential to progress in biomedical research, personalized medi...
Releasing sensitive data while preserving privacy is an important problem that has attracted conside...
These talks were presented for the Privacy Day Webinar 2022 sponsored by the American Statistical As...
Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal ...
Summary: Differential privacy allows quantifying privacy loss resulting from accession of sensitive ...
Funding Information: This work was supported by the Academy of Finland (grants 325573 , 325572 , 319...
In a world where artificial intelligence and data science become omnipresent, data sharing is increa...
How can we share sensitive datasets in such a way as to maximize utility while simultaneously safegu...
How can we share sensitive datasets in such a way as to maximize utility while simultaneously safegu...
Synthetic data has been advertised as a silver-bullet solution to privacy-preserving data publishing...
The availability of large amounts of informative data is crucial for successful machine learning. Ho...
Methods for privacy-preserving data publishing and analysis trade off privacy risks for individuals ...
AI-based data synthesis has seen rapid progress over the last several years and is increasingly reco...
The availability of genomic data is essential to progress in biomedical research, personalized medi...
Privacy-preserving data publishing is an important problem that has been the focus of extensive stud...
The availability of genomic data is essential to progress in biomedical research, personalized medi...
Releasing sensitive data while preserving privacy is an important problem that has attracted conside...
These talks were presented for the Privacy Day Webinar 2022 sponsored by the American Statistical As...
Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal ...
Summary: Differential privacy allows quantifying privacy loss resulting from accession of sensitive ...
Funding Information: This work was supported by the Academy of Finland (grants 325573 , 325572 , 319...
In a world where artificial intelligence and data science become omnipresent, data sharing is increa...
How can we share sensitive datasets in such a way as to maximize utility while simultaneously safegu...
How can we share sensitive datasets in such a way as to maximize utility while simultaneously safegu...
Synthetic data has been advertised as a silver-bullet solution to privacy-preserving data publishing...
The availability of large amounts of informative data is crucial for successful machine learning. Ho...
Methods for privacy-preserving data publishing and analysis trade off privacy risks for individuals ...
AI-based data synthesis has seen rapid progress over the last several years and is increasingly reco...
The availability of genomic data is essential to progress in biomedical research, personalized medi...
Privacy-preserving data publishing is an important problem that has been the focus of extensive stud...
The availability of genomic data is essential to progress in biomedical research, personalized medi...
Releasing sensitive data while preserving privacy is an important problem that has attracted conside...
These talks were presented for the Privacy Day Webinar 2022 sponsored by the American Statistical As...