The production of synthetic datasets has been proposed as a statistical disclosure control solution to generate public use files out of protected data, and as a tool to create "augmented datasets" to serve as input for micro-simulation models. Synthetic data have become an important instrument for ex-ante assessments of policy impact. The performance and acceptability of such a tool relies heavily on the quality of the synthetic populations, i.e., on the statistical similarity between the synthetic and the true population of interest. Multiple approaches and tools have been developed to generate synthetic data. These approaches can be categorized into three main groups: synthetic reconstruction, combinatorial optimization, and model-based g...
The analysis of large administrative data sets can provide researchers with answers to many research...
In contrast to the many public-use microdata samples available for individual and household data fro...
Microsimulations may involve a large number of agents. It is then practically impossible or too expe...
The production of synthetic datasets has been proposed as a statistical disclosure control solution ...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
Open research data provide considerable scientific, societal, and economic benefits. However, disclo...
Simulation studies are widely used by statisticians to gain insight into the quality of developed me...
The modern paradigms of machine learning algorithms and artificial intelligence base their success o...
Datasets of different characteristics are needed by the research com-munity for experimental purpose...
In this presentation we will define Synthetic populations and illustrate the value it provides to mo...
To avoid disclosures, Rubin proposed creating multiple, synthetic data sets for public release so th...
The use of synthetic data sets are becoming ever more prevalent, as regulations such as the General ...
The Synthetic Population Catalyst (SPC) is an open-source tool for the simulation of populations. B...
Archive for the synthetic data pre-conference workshop at the Open Science Festival on September 1, ...
The analysis of large administrative data sets can provide researchers with answers to many research...
In contrast to the many public-use microdata samples available for individual and household data fro...
Microsimulations may involve a large number of agents. It is then practically impossible or too expe...
The production of synthetic datasets has been proposed as a statistical disclosure control solution ...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
Open research data provide considerable scientific, societal, and economic benefits. However, disclo...
Simulation studies are widely used by statisticians to gain insight into the quality of developed me...
The modern paradigms of machine learning algorithms and artificial intelligence base their success o...
Datasets of different characteristics are needed by the research com-munity for experimental purpose...
In this presentation we will define Synthetic populations and illustrate the value it provides to mo...
To avoid disclosures, Rubin proposed creating multiple, synthetic data sets for public release so th...
The use of synthetic data sets are becoming ever more prevalent, as regulations such as the General ...
The Synthetic Population Catalyst (SPC) is an open-source tool for the simulation of populations. B...
Archive for the synthetic data pre-conference workshop at the Open Science Festival on September 1, ...
The analysis of large administrative data sets can provide researchers with answers to many research...
In contrast to the many public-use microdata samples available for individual and household data fro...
Microsimulations may involve a large number of agents. It is then practically impossible or too expe...