Summary: Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal data. Repeated accesses to underlying data incur increasing loss. Releasing data as privacy-preserving synthetic data would avoid this limitation but would leave open the problem of designing what kind of synthetic data. We propose formulating the problem of private data release through probabilistic modeling. This approach transforms the problem of designing the synthetic data into choosing a model for the data, allowing also the inclusion of prior knowledge, which improves the quality of the synthetic data. We demonstrate empirically, in an epidemiological study, that statistical discoveries can be reliably reproduced from the synt...
An individual's personal information is gathered by a multitude of different data collectors through...
When releasing data for public use, statistical agencies seek to reduce the risk of disclosure, whil...
Medical data often contain sensitive personal information about individuals, posing significant limi...
Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal ...
Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal ...
Funding Information: This work was supported by the Academy of Finland (grants 325573 , 325572 , 319...
How can we share sensitive datasets in such a way as to maximize utility while simultaneously safegu...
With the growing concerns over data privacy and new regulations like the General Data Protection Reg...
Synthetic data has been advertised as a silver-bullet solution to privacy-preserving data publishing...
These talks were presented for the Privacy Day Webinar 2022 sponsored by the American Statistical As...
The predictive potential of the many large datasets being held in healthcare, financial markets, soc...
When releasing data for public use, statistical agencies seek to reduce the risk of disclosure, whil...
An individual's personal information is gathered by a multitude of different data collectors through...
When releasing data for public use, statistical agencies seek to reduce the risk of disclosure, whil...
When releasing data for public use, statistical agencies seek to reduce the risk of disclosure, whil...
An individual's personal information is gathered by a multitude of different data collectors through...
When releasing data for public use, statistical agencies seek to reduce the risk of disclosure, whil...
Medical data often contain sensitive personal information about individuals, posing significant limi...
Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal ...
Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal ...
Funding Information: This work was supported by the Academy of Finland (grants 325573 , 325572 , 319...
How can we share sensitive datasets in such a way as to maximize utility while simultaneously safegu...
With the growing concerns over data privacy and new regulations like the General Data Protection Reg...
Synthetic data has been advertised as a silver-bullet solution to privacy-preserving data publishing...
These talks were presented for the Privacy Day Webinar 2022 sponsored by the American Statistical As...
The predictive potential of the many large datasets being held in healthcare, financial markets, soc...
When releasing data for public use, statistical agencies seek to reduce the risk of disclosure, whil...
An individual's personal information is gathered by a multitude of different data collectors through...
When releasing data for public use, statistical agencies seek to reduce the risk of disclosure, whil...
When releasing data for public use, statistical agencies seek to reduce the risk of disclosure, whil...
An individual's personal information is gathered by a multitude of different data collectors through...
When releasing data for public use, statistical agencies seek to reduce the risk of disclosure, whil...
Medical data often contain sensitive personal information about individuals, posing significant limi...