Datasets of different characteristics are needed by the research com-munity for experimental purposes. However, real data may be difficult to obtain due to privacy concerns. Moreover, real data may not meet specific characteristics which are needed to verify new approaches under certain conditions. Given these limitations, the use of synthetic data is a viable alternative to complement the real data. In this report, we describe the process followed to generate synthetic data using Benerator, a publicly available tool. The results show that the synthetic data preserves a high level of accuracy compared to the original data. The generated datasets correspond to microdata containing records with social, economic and demographic data which mimi...
In order to understand the health outcomes for distinct sub-groups of the population or across diffe...
Over the past decades, microsimulations have increasingly become an important component of modern em...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
The production of synthetic datasets has been proposed as a statistical disclosure control solution ...
The production of synthetic datasets has been proposed as a statistical disclosure control solution ...
The access of quality data is valuable, and there is an increased attention to sharing data within t...
The Dutch Central Bureau of Statistics (CBS) possesses extremely valuable sensitive datasets with in...
The analysis of large administrative data sets can provide researchers with answers to many research...
Open research data provide considerable scientific, societal, and economic benefits. However, disclo...
New forms of administrative and linked data containing high levels of attribute and spatial detail p...
Presented at World Statistical Congress 2013.National Statistical offices (NSOs) create official sta...
A large portion of data collected by many organisations today is about people, and often contains pe...
With the recent advances and increasing activities in data mining and analysis, the protection of th...
New forms of administrative and linked data with high attribute and spatial detail present increased...
In order to understand the health outcomes for distinct sub-groups of the population or across diffe...
Over the past decades, microsimulations have increasingly become an important component of modern em...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...
The production of synthetic datasets has been proposed as a statistical disclosure control solution ...
The production of synthetic datasets has been proposed as a statistical disclosure control solution ...
The access of quality data is valuable, and there is an increased attention to sharing data within t...
The Dutch Central Bureau of Statistics (CBS) possesses extremely valuable sensitive datasets with in...
The analysis of large administrative data sets can provide researchers with answers to many research...
Open research data provide considerable scientific, societal, and economic benefits. However, disclo...
New forms of administrative and linked data containing high levels of attribute and spatial detail p...
Presented at World Statistical Congress 2013.National Statistical offices (NSOs) create official sta...
A large portion of data collected by many organisations today is about people, and often contains pe...
With the recent advances and increasing activities in data mining and analysis, the protection of th...
New forms of administrative and linked data with high attribute and spatial detail present increased...
In order to understand the health outcomes for distinct sub-groups of the population or across diffe...
Over the past decades, microsimulations have increasingly become an important component of modern em...
In many contexts, confidentiality constraints severely restrict access to unique and valuable microd...