The development of platforms and techniques for emerging Big Data and Machine Learning applications requires the availability of real-life datasets. A possible solution is to synthesize datasets that reflect patterns of real ones using a two-step approach: first, a real dataset X is analyzed to derive relevant patterns Z and, then, to use such patterns for reconstructing a new dataset X\u27 that preserves the main characteristics of X. This survey explores two possible approaches: (1) Constraint-based generation and (2) probabilistic generative modeling. The former is devised using inverse mining (IFM) techniques, and consists of generating a dataset satisfying given support constraints on the itemsets of an input set, that are typically th...
In this thesis, we investigate various approaches for generative modeling, with a special emphasis o...
Data-driven and model-driven methodologies can be regarded as competitive fields since they tackle s...
Probabilistic reasoning, among methodologies used within the domain of artificial intelligence, is r...
The development of platforms and techniques for emerging Big Data and Machine Learning applications ...
The real data are not always available/accessible/sufficient or in many cases they are incomplete an...
The development of novel platforms and techniques for emerging “Big Data” applications requires the ...
[EN] Machine learning has becoming a trending topic in the last years, being now one of the most de...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We introduce marginalization models (MaMs), a new family of generative models for high-dimensional d...
This paper proposes three different data generators, tailored to transactional datasets, based on ex...
Pattern mining is a fundamental data mining task with applications in several domains. In this work,...
Despite recent advances, goal-directed generation of structured discrete data remains challenging. F...
Douzas, G., Lechleitner, M., & Bacao, F. (2022). Improving the quality of predictive models in small...
Over the past decade, the growth of data has been phenomenal. The amount of data which the world acc...
Pattern mining is a fundamental data mining task with applications in several domains. In this work,...
In this thesis, we investigate various approaches for generative modeling, with a special emphasis o...
Data-driven and model-driven methodologies can be regarded as competitive fields since they tackle s...
Probabilistic reasoning, among methodologies used within the domain of artificial intelligence, is r...
The development of platforms and techniques for emerging Big Data and Machine Learning applications ...
The real data are not always available/accessible/sufficient or in many cases they are incomplete an...
The development of novel platforms and techniques for emerging “Big Data” applications requires the ...
[EN] Machine learning has becoming a trending topic in the last years, being now one of the most de...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We introduce marginalization models (MaMs), a new family of generative models for high-dimensional d...
This paper proposes three different data generators, tailored to transactional datasets, based on ex...
Pattern mining is a fundamental data mining task with applications in several domains. In this work,...
Despite recent advances, goal-directed generation of structured discrete data remains challenging. F...
Douzas, G., Lechleitner, M., & Bacao, F. (2022). Improving the quality of predictive models in small...
Over the past decade, the growth of data has been phenomenal. The amount of data which the world acc...
Pattern mining is a fundamental data mining task with applications in several domains. In this work,...
In this thesis, we investigate various approaches for generative modeling, with a special emphasis o...
Data-driven and model-driven methodologies can be regarded as competitive fields since they tackle s...
Probabilistic reasoning, among methodologies used within the domain of artificial intelligence, is r...