Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, February, 2020Cataloged from PDF version of thesis.Includes bibliographical references (pages 55-62).This thesis is motivated by two major trends in data science: easy access to tremendous amounts of unstructured data and the effectiveness of Machine Learning (ML) in data driven applications. As a result, there is a growing need to integrate ML models and data curation into a homogeneous system such that the model informs the choice and extent of data curation. The bottleneck in designing such a system is to efficiently discern what additional information would result in improving the generalization of the ML models. We design an ...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Abstract. We provide a methodology which integrates dynamic feature generation from relational datab...
1095-1100Multi-relational classification is highly challengeable task in data mining, because so muc...
© 2020, VLDB Endowment. Automatic machine learning (AML) is a family of techniques to automate the p...
Automatic machine learning is a subfield of machine learning that automates the common procedures fa...
Machine Learning (ML) applications require high-quality datasets. Automated data augmentation techni...
The democratization of data science, and in particular of the machine learning pipeline, has focused...
Machine Learning (ML) requires a certain number of features (i.e., attributes) to train the model. O...
One fundamental limitation of classical statistical modeling is the assumption that data is represen...
Various features come from relational data often used to enhance the prediction of statistical model...
In this talk, I will make the case for a first-principles approach to machine learning over relation...
Machine learning (ML) has emerged as a vital asset for researchers to analyze and extract valuable i...
We consider the problem of computing machine learning models over multi-relational databases. The ma...
Regularization is one of the key concepts in machine learning, but so far it has received only littl...
Enterprise data analytics is a booming area in the data man-agement industry. Many companies are rac...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Abstract. We provide a methodology which integrates dynamic feature generation from relational datab...
1095-1100Multi-relational classification is highly challengeable task in data mining, because so muc...
© 2020, VLDB Endowment. Automatic machine learning (AML) is a family of techniques to automate the p...
Automatic machine learning is a subfield of machine learning that automates the common procedures fa...
Machine Learning (ML) applications require high-quality datasets. Automated data augmentation techni...
The democratization of data science, and in particular of the machine learning pipeline, has focused...
Machine Learning (ML) requires a certain number of features (i.e., attributes) to train the model. O...
One fundamental limitation of classical statistical modeling is the assumption that data is represen...
Various features come from relational data often used to enhance the prediction of statistical model...
In this talk, I will make the case for a first-principles approach to machine learning over relation...
Machine learning (ML) has emerged as a vital asset for researchers to analyze and extract valuable i...
We consider the problem of computing machine learning models over multi-relational databases. The ma...
Regularization is one of the key concepts in machine learning, but so far it has received only littl...
Enterprise data analytics is a booming area in the data man-agement industry. Many companies are rac...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Abstract. We provide a methodology which integrates dynamic feature generation from relational datab...
1095-1100Multi-relational classification is highly challengeable task in data mining, because so muc...