Nowadays, the major challenge in machine learning is the ‘Big Data’ challenge. The big data problems due to large number of data points or large number of features in each data point, or both, the training of models have become very slow. The training time has two major components: Time to access the data and time to process (learn from) the data. So far, the research has focused only on the second part, i.e., learning from the data. In this paper, we have proposed one possible solution to handle the big data problems in machine learning. The idea is to reduce the training time through reducing data access time by proposing systematic sampling and cyclic/sequential sampling to select mini-batches from the dataset. To prove the effectiveness...
Recent years have witnessed huge advances in machine learning (ML) and its applications, especially ...
The amount of data being generated and stored is growing exponentially, owed in part to the continui...
We describe two techniques that signicantly improve the running time of several stan-dard machine-le...
Nowadays, the major challenge in machine learning is the ‘Big Data’ challenge. The big data problems...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
Machine learning has achieved tremendous successes and played increasingly essential roles in many a...
This work considers optimization methods for large-scale machine learning (ML). Optimization in ML ...
We examine the learning-curve sampling method, an approach for applying machinelearning algorithms t...
Thesis (Ph.D.)--University of Washington, 2018To learn from large datasets, modern machine learning ...
Given a large data set and a classification learning algorithm, Progressive Sampling (PS) uses incre...
The rapid development of modern information technology has significantly facilitated the generation,...
This paper reviews the appropriateness for application to large data sets of standard machine learni...
Importance sampling, a variant of online sampling, is often used in neural network training to impro...
Abstract This research paper presents an innovative approach to gradient descent known as ‘‘Sample G...
University of Minnesota Ph.D. dissertation. April 2020. Major: Computer Science. Advisor: Arindam Ba...
Recent years have witnessed huge advances in machine learning (ML) and its applications, especially ...
The amount of data being generated and stored is growing exponentially, owed in part to the continui...
We describe two techniques that signicantly improve the running time of several stan-dard machine-le...
Nowadays, the major challenge in machine learning is the ‘Big Data’ challenge. The big data problems...
One of the fundamental machine learning tasks is that of predictive classification. Given that organ...
Machine learning has achieved tremendous successes and played increasingly essential roles in many a...
This work considers optimization methods for large-scale machine learning (ML). Optimization in ML ...
We examine the learning-curve sampling method, an approach for applying machinelearning algorithms t...
Thesis (Ph.D.)--University of Washington, 2018To learn from large datasets, modern machine learning ...
Given a large data set and a classification learning algorithm, Progressive Sampling (PS) uses incre...
The rapid development of modern information technology has significantly facilitated the generation,...
This paper reviews the appropriateness for application to large data sets of standard machine learni...
Importance sampling, a variant of online sampling, is often used in neural network training to impro...
Abstract This research paper presents an innovative approach to gradient descent known as ‘‘Sample G...
University of Minnesota Ph.D. dissertation. April 2020. Major: Computer Science. Advisor: Arindam Ba...
Recent years have witnessed huge advances in machine learning (ML) and its applications, especially ...
The amount of data being generated and stored is growing exponentially, owed in part to the continui...
We describe two techniques that signicantly improve the running time of several stan-dard machine-le...