High-performance computing (HPC) and machine learning (ML) have been widely adopted by both academia and industries to address enormous data problems at extreme scales. While research has reported on the interactions of HPC and ML, achieving high performance and scalability for parallel and distributed ML algorithms is still a challenging task. This dissertation first summarizes the major challenges for applying HPC to ML applications: 1) poor performance and scalability, 2) loss of the convergence rate, 3) lower quality of the trained model, and 4) a lack of performance optimization techniques designed for specific applications. Researchers can address the four challenges in new ML applications. This dissertation shows how to solve them fo...
Nowadays, many real-world applications can be represented as machine learning and graph processing (...
Training machine learning (ML) algorithms is a computationally intensive process, which is frequentl...
Implementing machine learning algorithms for large data, such as the Web graph and social networks, ...
Machine learning (ML) is a cornerstone of the new data revolution. Most attempts to scale machine le...
Large scale machine learning has many characteristics that can be exploited in the system designs to...
Conventionally programmed systems (e.g. robots) are not able to adapt to unforeseen changes in their...
To support large-scale machine learning, distributed training is a promising approach as large-scale...
Machine learning algorithms are now being deployed in practically all areas of our lives. Part of th...
Many machine learning algorithms iteratively process datapoints and transform global model parameter...
Big Data has been a catalyst force for the Machine Learning (ML) area, forcing us to rethink existin...
This thesis proposes several optimization methods that utilize parallel algorithms for large-scale m...
Machine learning algorithms are very successful in solving classification and regression problems, h...
Machine learning approaches have been widely adopted in recent years due to their capability of lear...
SystemML aims at declarative, large-scale machine learning (ML) on top of MapReduce, where high-leve...
The interdisciplinary field of neuroscience has made significant progress in recent decades, providi...
Nowadays, many real-world applications can be represented as machine learning and graph processing (...
Training machine learning (ML) algorithms is a computationally intensive process, which is frequentl...
Implementing machine learning algorithms for large data, such as the Web graph and social networks, ...
Machine learning (ML) is a cornerstone of the new data revolution. Most attempts to scale machine le...
Large scale machine learning has many characteristics that can be exploited in the system designs to...
Conventionally programmed systems (e.g. robots) are not able to adapt to unforeseen changes in their...
To support large-scale machine learning, distributed training is a promising approach as large-scale...
Machine learning algorithms are now being deployed in practically all areas of our lives. Part of th...
Many machine learning algorithms iteratively process datapoints and transform global model parameter...
Big Data has been a catalyst force for the Machine Learning (ML) area, forcing us to rethink existin...
This thesis proposes several optimization methods that utilize parallel algorithms for large-scale m...
Machine learning algorithms are very successful in solving classification and regression problems, h...
Machine learning approaches have been widely adopted in recent years due to their capability of lear...
SystemML aims at declarative, large-scale machine learning (ML) on top of MapReduce, where high-leve...
The interdisciplinary field of neuroscience has made significant progress in recent decades, providi...
Nowadays, many real-world applications can be represented as machine learning and graph processing (...
Training machine learning (ML) algorithms is a computationally intensive process, which is frequentl...
Implementing machine learning algorithms for large data, such as the Web graph and social networks, ...