This work impliments GPU optimizations for the Cholesky decomposition and its derivative in the Stan Math library (Carpenter et al. 2015). The Stan library’s No-U-Turn sampler (NUTS) typically explores the target distribution more efficiently than alternative samplers, though it is computationally more expensive per log probability evaluation. This research is motivated by large Gaussian Process (GP) models, where the log probability evaluation is very expensive and dominated by the inversion of the covariance matrix typically done within the Cholesky decomposition. Experimental results show that GPU optimizations are not optimal for small n × m matrices, however N = 5000 matrices can see speedups of 6x while retaining precision. This is th...
We describe an efficient Bayesian parallel GPU implementation of two classic statistical models-the ...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
© 2019, Pleiades Publishing, Ltd. Practical applicability of many statistical algorithms is limited ...
Our presentations details the current state of and future work on the OpenCL-based framework that al...
This article describes advances in statistical computation for large-scale data analy-sis in structu...
As Central Processing Units (CPUs) and Graphical Processing Units (GPUs) get progressively better, d...
Abstract—Currently, state of the art libraries, like MAGMA, focus on very large linear algebra probl...
AbstractSolving a large number of relatively small linear systems has recently drawn more attention ...
A graphical processing unit (GPU) is a hardware device normally used to manipulate computer memory f...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
[[abstract]]In linear algebra, Cholesky factorization is useful in solving a system of equations wit...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
Abstract: The graphics processing unit (GPU) has emerged as a power-ful and cost effective processor...
GPUs are getting more and more important in scientific computing, slowly growing from peripheral acc...
Data analysis is a rising field of interest for computer science research due to the growing amount ...
We describe an efficient Bayesian parallel GPU implementation of two classic statistical models-the ...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
© 2019, Pleiades Publishing, Ltd. Practical applicability of many statistical algorithms is limited ...
Our presentations details the current state of and future work on the OpenCL-based framework that al...
This article describes advances in statistical computation for large-scale data analy-sis in structu...
As Central Processing Units (CPUs) and Graphical Processing Units (GPUs) get progressively better, d...
Abstract—Currently, state of the art libraries, like MAGMA, focus on very large linear algebra probl...
AbstractSolving a large number of relatively small linear systems has recently drawn more attention ...
A graphical processing unit (GPU) is a hardware device normally used to manipulate computer memory f...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
[[abstract]]In linear algebra, Cholesky factorization is useful in solving a system of equations wit...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
Abstract: The graphics processing unit (GPU) has emerged as a power-ful and cost effective processor...
GPUs are getting more and more important in scientific computing, slowly growing from peripheral acc...
Data analysis is a rising field of interest for computer science research due to the growing amount ...
We describe an efficient Bayesian parallel GPU implementation of two classic statistical models-the ...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
© 2019, Pleiades Publishing, Ltd. Practical applicability of many statistical algorithms is limited ...