We present a technique for designing memory-bound algorithms with high data reuse on Graphics Processing Units (GPUs) equipped with close-to-ALU software-managed memory. The approach is based on the efficient use of this memory through the implementation of a software-managed cache. We also present an analytical model for performance analysis of such algorithms. We apply this technique to the implementation of the GPU-based solver of the sum-product or marginalize a product of functions (MPF) problem, which arises in a wide variety of real-life applications in artificial intelligence, statistics, image processing, and digital communications. Our motivation to accelerate MPF originated in the context of the analysis of genetic diseases, ...
Graphics Processing Units (GPUs) are becoming more and more prevalent in general-purpose computing. ...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
Background: In the last few years, the Non-negative Matrix Factorization (NMF) technique has gained ...
This paper deals with solving large instances of the Linear Sum Assignment Problems (LSAPs) under re...
GPUs have become popular due to their high computational power. Data scientists rely on GPUs to proc...
The graphics processing unit (GPU) was initially designed for raster-based graphics com- putations, ...
GPUs have become popular due to their high computational power. Data scientists rely on GPUs to proc...
The last few years has seen an explosion of effort in designing algorithms that harness the power of...
Multiple precision (MP) arithmetic is a core building block of a wide variety of algorithms in compu...
The computation power from graphics processing units (GPUs) has become prevalent in many fields of c...
Stencil computations form the basis for computer simulations across almost every field of science, s...
Abstract. In this paper we modify a fast heuristic solver for the Linear Sum Assignment Problem (LSA...
MATLAB is an array language that is being increasingly used for prototyping and developing code for ...
Developing high performance GPGPU programs is challenging for application developers since the perfo...
Developing high performance GPGPU programs is challenging for application developers since the perfo...
Graphics Processing Units (GPUs) are becoming more and more prevalent in general-purpose computing. ...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
Background: In the last few years, the Non-negative Matrix Factorization (NMF) technique has gained ...
This paper deals with solving large instances of the Linear Sum Assignment Problems (LSAPs) under re...
GPUs have become popular due to their high computational power. Data scientists rely on GPUs to proc...
The graphics processing unit (GPU) was initially designed for raster-based graphics com- putations, ...
GPUs have become popular due to their high computational power. Data scientists rely on GPUs to proc...
The last few years has seen an explosion of effort in designing algorithms that harness the power of...
Multiple precision (MP) arithmetic is a core building block of a wide variety of algorithms in compu...
The computation power from graphics processing units (GPUs) has become prevalent in many fields of c...
Stencil computations form the basis for computer simulations across almost every field of science, s...
Abstract. In this paper we modify a fast heuristic solver for the Linear Sum Assignment Problem (LSA...
MATLAB is an array language that is being increasingly used for prototyping and developing code for ...
Developing high performance GPGPU programs is challenging for application developers since the perfo...
Developing high performance GPGPU programs is challenging for application developers since the perfo...
Graphics Processing Units (GPUs) are becoming more and more prevalent in general-purpose computing. ...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
Background: In the last few years, the Non-negative Matrix Factorization (NMF) technique has gained ...