Douzas, G., Lechleitner, M., & Bacao, F. (2022). Improving the quality of predictive models in small data GSDOT: A new algorithm for generating synthetic data. PLoS ONE, 17(4), 1-15. [e0265626]. https://doi.org/10.1371/journal.pone.0265626In the age of the data deluge there are still many domains and applications restricted to the use of small datasets. The ability to harness these small datasets to solve problems through the use of supervised learning methods can have a significant impact in many important areas. The insufficient size of training data usually results in unsatisfactory performance of machine learning algorithms. The current research work aims to contribute to mitigate the small data problem through the creation of artificia...
The advent of data mining and machine learning has highlighted the value of large and varied sources...
It is difficult for learning models to achieve high classification performances with imbalanced data...
Neural networks have been applied successfully in many fields. However, satisfactory results can onl...
Douzas, G., Rauch, R., & Bacao, F. (2021). G-SOMO: An oversampling approach based on self-organized ...
Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science a...
The real data are not always available/accessible/sufficient or in many cases they are incomplete an...
The field of machine learning has made a lot of progress in the recent years. As it is used more fre...
Recent breakthroughs in synthetic data generation approaches made it possible to produce highly phot...
Developing predictive models for classification problems considering imbalanced datasets is one of t...
[[abstract]]It is difficult for learning models to achieve high classification performances with imb...
The last few years have witnessed the rise of the big data era, which features the prevalence of dat...
Deep generative models, which target reproducing the given data distribution to produce novel sample...
Within machine learning, the problem of class imbalance refers to the scenario in which one or more ...
The development of platforms and techniques for emerging Big Data and Machine Learning applications ...
Over the past decade, deep learning has pro- foundly transformed the landscape of science and tech-...
The advent of data mining and machine learning has highlighted the value of large and varied sources...
It is difficult for learning models to achieve high classification performances with imbalanced data...
Neural networks have been applied successfully in many fields. However, satisfactory results can onl...
Douzas, G., Rauch, R., & Bacao, F. (2021). G-SOMO: An oversampling approach based on self-organized ...
Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science a...
The real data are not always available/accessible/sufficient or in many cases they are incomplete an...
The field of machine learning has made a lot of progress in the recent years. As it is used more fre...
Recent breakthroughs in synthetic data generation approaches made it possible to produce highly phot...
Developing predictive models for classification problems considering imbalanced datasets is one of t...
[[abstract]]It is difficult for learning models to achieve high classification performances with imb...
The last few years have witnessed the rise of the big data era, which features the prevalence of dat...
Deep generative models, which target reproducing the given data distribution to produce novel sample...
Within machine learning, the problem of class imbalance refers to the scenario in which one or more ...
The development of platforms and techniques for emerging Big Data and Machine Learning applications ...
Over the past decade, deep learning has pro- foundly transformed the landscape of science and tech-...
The advent of data mining and machine learning has highlighted the value of large and varied sources...
It is difficult for learning models to achieve high classification performances with imbalanced data...
Neural networks have been applied successfully in many fields. However, satisfactory results can onl...