We present the design and optimization of a linear solver on General Purpose GPUs for the efficient and high-throughput evaluation of the marginalized graph kernel between pairs of labeled graphs. The solver implements a preconditioned conjugate gradient (PCG) method to compute the solution to a generalized Laplacian equation associated with the tensor product of two graphs. To cope with the gap between the instruction throughput and the memory bandwidth of current generation GPUs, our solver forms the tensor product linear system on-the-fly without storing it in memory when performing matrix-vector dot product operations in PCG. Such on-the-fly computation is accomplished by using threads in a warp to cooperatively stream the adjacency and...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Abstract—Krylov subspace solvers are often the method of choice when solving sparse linear systems i...
Graphs provide the ability to extract valuable met-rics from the structural properties of the underl...
We present the design and optimization of a linear solver on General Purpose GPUs for the efficient ...
Abstract. The limiting factor for efficiency of sparse linear solvers is the memory bandwidth. In th...
Scaling up the sparse matrix-vector multiplication kernel on modern Graphics Processing Units (GPU) ...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
Achieving high performance and performance portability for large-scale scientific applications is a ...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
Graphics processing units (GPUs) are used as accelerators for algorithms in which the same instructi...
International audienceWhereas most today parallel High Performance Computing (HPC) software is writt...
High-performance implementations of graph algorithms are challenging to implement on new parallel ha...
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
The modern GPUs are well suited for intensive computational tasks and massive parallel computation. ...
The graphics processing unit (GPU) was initially designed for raster-based graphics com- putations, ...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Abstract—Krylov subspace solvers are often the method of choice when solving sparse linear systems i...
Graphs provide the ability to extract valuable met-rics from the structural properties of the underl...
We present the design and optimization of a linear solver on General Purpose GPUs for the efficient ...
Abstract. The limiting factor for efficiency of sparse linear solvers is the memory bandwidth. In th...
Scaling up the sparse matrix-vector multiplication kernel on modern Graphics Processing Units (GPU) ...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
Achieving high performance and performance portability for large-scale scientific applications is a ...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
Graphics processing units (GPUs) are used as accelerators for algorithms in which the same instructi...
International audienceWhereas most today parallel High Performance Computing (HPC) software is writt...
High-performance implementations of graph algorithms are challenging to implement on new parallel ha...
Abstract. Linear systems are required to solve in many scientific applications and the solution of t...
The modern GPUs are well suited for intensive computational tasks and massive parallel computation. ...
The graphics processing unit (GPU) was initially designed for raster-based graphics com- putations, ...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Abstract—Krylov subspace solvers are often the method of choice when solving sparse linear systems i...
Graphs provide the ability to extract valuable met-rics from the structural properties of the underl...