Synthetic data has gained significant momentum thanks to sophisticated machine learning tools that enable the synthesis of high-dimensional datasets. However, many generation techniques do not give the data controller control over what statistical patterns are captured, leading to concerns over privacy protection. While synthetic records are not linked to a particular real-world individual, they can reveal information about users indirectly which may be unacceptable for data owners. There is thus a need to empirically verify the privacy of synthetic data -- a particularly challenging task in high-dimensional data. In this paper we present a general framework for synthetic data generation that gives data controllers full control over which s...
Existing private synthetic data generation algorithms are agnostic to downstream tasks. However, end...
Abstract—Evaluating the performance of database systems is crucial when database vendors or research...
The U.S. Census Longitudinal Business Database (LBD) product contains employment and payroll informa...
AI-based data synthesis has seen rapid progress over the last several years and is increasingly reco...
Synthetic data generation is a powerful tool for privacy protection when considering public release ...
Synthetic data has been advertised as a silver-bullet solution to privacy-preserving data publishing...
How can we share sensitive datasets in such a way as to maximize utility while simultaneously safegu...
This explainer document aims to provide an overview of the current state of the rapidly expanding wo...
With the recent advances and increasing activities in data mining and analysis, the protection of th...
This is the final version. Available on open access from SAGE Publications via the DOI in this recor...
Archive for the synthetic data pre-conference workshop at the Open Science Festival on September 1, ...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
Synthetic data is emerging as the most promising solution to share individual-level data while safeg...
In a world where artificial intelligence and data science become omnipresent, data sharing is increa...
Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal ...
Existing private synthetic data generation algorithms are agnostic to downstream tasks. However, end...
Abstract—Evaluating the performance of database systems is crucial when database vendors or research...
The U.S. Census Longitudinal Business Database (LBD) product contains employment and payroll informa...
AI-based data synthesis has seen rapid progress over the last several years and is increasingly reco...
Synthetic data generation is a powerful tool for privacy protection when considering public release ...
Synthetic data has been advertised as a silver-bullet solution to privacy-preserving data publishing...
How can we share sensitive datasets in such a way as to maximize utility while simultaneously safegu...
This explainer document aims to provide an overview of the current state of the rapidly expanding wo...
With the recent advances and increasing activities in data mining and analysis, the protection of th...
This is the final version. Available on open access from SAGE Publications via the DOI in this recor...
Archive for the synthetic data pre-conference workshop at the Open Science Festival on September 1, ...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
Synthetic data is emerging as the most promising solution to share individual-level data while safeg...
In a world where artificial intelligence and data science become omnipresent, data sharing is increa...
Differential privacy allows quantifying privacy loss resulting from accession of sensitive personal ...
Existing private synthetic data generation algorithms are agnostic to downstream tasks. However, end...
Abstract—Evaluating the performance of database systems is crucial when database vendors or research...
The U.S. Census Longitudinal Business Database (LBD) product contains employment and payroll informa...