The recent dramatic progress in machine learning is partially attributed to the availability of high-performant computers and development tools. The accelerated linear algebra (XLA) compiler is one such tool that automatically optimises array operations (mostly fusion to reduce memory operations) and compiles the optimised operations into high-performant programs specific to target computing platforms. Like machine-learning models, numerical models are often expressed in array operations, and thus their performance can be boosted by XLA. This study is the first of its kind to examine the efficiency of XLA for numerical models, and the efficiency is examined stringently by comparing its performance with that of optimal implementations. Two s...
This dissertation incorporates two research projects: performance modeling and prediction for dense ...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
Research areas: Heterogeneous Computing, Statistical Machine Learning, Accelerator DesignA growing n...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
Machine Learning involves analysing large sets of training data to make predictions and decisions to...
Training machine learning (ML) algorithms is a computationally intensive process, which is frequentl...
Computers are powerful tools which perform fast, accurate calculations over huge sets of data. Howev...
This thesis describes novel techniques and test implementations for optimizing numerically intensive...
As Central Processing Units (CPUs) and Graphical Processing Units (GPUs) get progressively better, d...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
Machine learning has risen to prominence in recent years thanks to advancements in computer technolo...
This dissertation incorporates two research projects: performance modeling and prediction for dense ...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
Research areas: Heterogeneous Computing, Statistical Machine Learning, Accelerator DesignA growing n...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
Machine Learning involves analysing large sets of training data to make predictions and decisions to...
Training machine learning (ML) algorithms is a computationally intensive process, which is frequentl...
Computers are powerful tools which perform fast, accurate calculations over huge sets of data. Howev...
This thesis describes novel techniques and test implementations for optimizing numerically intensive...
As Central Processing Units (CPUs) and Graphical Processing Units (GPUs) get progressively better, d...
The resurgence of machine learning since the late 1990s has been enabled by significant advances in ...
Machine learning has risen to prominence in recent years thanks to advancements in computer technolo...
This dissertation incorporates two research projects: performance modeling and prediction for dense ...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
Research areas: Heterogeneous Computing, Statistical Machine Learning, Accelerator DesignA growing n...