The recent dramatic progress in machine learning is partially attributed to the availability of high-performant computers and development tools. The accelerated linear algebra (XLA) compiler is one such tool that automatically optimises array operations (mostly fusion to reduce memory operations) and compiles the optimised operations into high-performant programs specific to target computing platforms. Like machine-learning models, numerical models are often expressed in array operations, and thus their performance can be boosted by XLA. This study is the first of its kind to examine the efficiency of XLA for numerical models, and the efficiency is examined stringently by comparing its performance with that of optimal implementations. Two s...
International audienceIn this work, numerical algebraic operations are performed by using several li...
This dissertation details contributions made by the author to the field of computer science while wo...
Relative efficiency of XLA for numerical models (hollow = f64; filled = f32; circles = HEAT1D; trian...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
This thesis describes novel techniques and test implementations for optimizing numerically intensive...
A plethora of program analysis and optimization techniques rely on linear programming at their heart...
Computers are powerful tools which perform fast, accurate calculations over huge sets of data. Howev...
Machine Learning involves analysing large sets of training data to make predictions and decisions to...
This dissertation incorporates two research projects: performance modeling and prediction for dense ...
Training machine learning (ML) algorithms is a computationally intensive process, which is frequentl...
International audienceIn this work, numerical algebraic operations are performed by using several li...
This dissertation details contributions made by the author to the field of computer science while wo...
Relative efficiency of XLA for numerical models (hollow = f64; filled = f32; circles = HEAT1D; trian...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
This thesis describes novel techniques and test implementations for optimizing numerically intensive...
A plethora of program analysis and optimization techniques rely on linear programming at their heart...
Computers are powerful tools which perform fast, accurate calculations over huge sets of data. Howev...
Machine Learning involves analysing large sets of training data to make predictions and decisions to...
This dissertation incorporates two research projects: performance modeling and prediction for dense ...
Training machine learning (ML) algorithms is a computationally intensive process, which is frequentl...
International audienceIn this work, numerical algebraic operations are performed by using several li...
This dissertation details contributions made by the author to the field of computer science while wo...
Relative efficiency of XLA for numerical models (hollow = f64; filled = f32; circles = HEAT1D; trian...