peer-reviewedConducting extensive testing of anonymization techniques is critical to assess their robustness and identify the scenarios where they are most suitable. However, the access to real microdata is highly restricted and the one that is publicly-available is usually anonymized or aggregated; hence, reducing its value for testing purposes. In this paper, we present a framework (COCOA) for the generation of realistic synthetic microdata that allows to de ne multi-attribute relationships in order to preserve the functional dependencies of the data. We prove how COCOA is useful to strengthen the testing of anonymization techniques by broadening the number and diversity of the test scenarios. Results also show how COCOA is prac...
Publishing data about individuals, in a privacy-preserving way, has led to a large body of research....
Synthetic data has been advertised as a silver-bullet solution to privacy-preserving data publishing...
With the recent advances and increasing activities in data mining and analysis, the protection of th...
Conducting extensive testing of anonymization techniques is critical to assess their robustness and ...
UNESCO Chair in Data Privacy, International Conference, PSD 2016, Dubrovnik, Croatia, September 14–1...
AI-based data synthesis has seen rapid progress over the last several years and is increasingly reco...
The application of machine learning techniques to large and distributed data archives might result i...
Information surges and advances in machine learning tools have enable the collection and storage of ...
The collection, publication, and mining of personal data have become key drivers of innovation and v...
Synthetic data has gained significant momentum thanks to sophisticated machine learning tools that e...
Data about individuals is being increasingly collected and disseminated for purposes such as busines...
Privacy becomes a more and more serious concern in applications involving microdata. Recently, effic...
International audienceInstitutions collect massive learning traces but they may not disclose it for ...
Today, the publication of microdata poses a privacy threat. Vast research has striven to define the ...
In recent years, data published and shared with third parties to develop artificial intelligence (AI...
Publishing data about individuals, in a privacy-preserving way, has led to a large body of research....
Synthetic data has been advertised as a silver-bullet solution to privacy-preserving data publishing...
With the recent advances and increasing activities in data mining and analysis, the protection of th...
Conducting extensive testing of anonymization techniques is critical to assess their robustness and ...
UNESCO Chair in Data Privacy, International Conference, PSD 2016, Dubrovnik, Croatia, September 14–1...
AI-based data synthesis has seen rapid progress over the last several years and is increasingly reco...
The application of machine learning techniques to large and distributed data archives might result i...
Information surges and advances in machine learning tools have enable the collection and storage of ...
The collection, publication, and mining of personal data have become key drivers of innovation and v...
Synthetic data has gained significant momentum thanks to sophisticated machine learning tools that e...
Data about individuals is being increasingly collected and disseminated for purposes such as busines...
Privacy becomes a more and more serious concern in applications involving microdata. Recently, effic...
International audienceInstitutions collect massive learning traces but they may not disclose it for ...
Today, the publication of microdata poses a privacy threat. Vast research has striven to define the ...
In recent years, data published and shared with third parties to develop artificial intelligence (AI...
Publishing data about individuals, in a privacy-preserving way, has led to a large body of research....
Synthetic data has been advertised as a silver-bullet solution to privacy-preserving data publishing...
With the recent advances and increasing activities in data mining and analysis, the protection of th...