GPUs are gaining fast adoption as high-performance computing architectures, mainly because of their impressive peak performance. Yet most applications only achieve small fractions of this performance. While both programmers and architects have clear opinions about the causes of this performance gap, finding and quantifying the real problems remains a topic for performance modeling tools. In this paper, we sketch the landscape of modern GPUs' performance limiters and optimization opportunities, and dive into details on modeling attempts for GPU-based systems. We highlight the specific features of the relevant contributions in this field, along with the optimization and design spaces they explore. We further use typical kernel examples with v...
Communicated by Susumu Matsumae In this paper we describe our performance-breakdown model for GPU pr...
One of the major challenges faced by the HPC community today is user-friendly and accurate heterogen...
We present an efficient model to analyze and improve the performance of general-purpose computation ...
Computing systems today rely on massively parallel and heterogeneous architectures to promise very h...
We develop a microbenchmark-based performance model for NVIDIA GeForce 200-series GPUs. Our model id...
We develop a microbenchmark-based performance model for NVIDIA GeForce 200-series GPUs. Our model id...
The increasing programmability, performance, and cost/effectiveness of GPUs have led to a widespread...
High-level tools for analyzing and predicting the performance GPU-accelerated applications are scarc...
This work presents an in-depth study of the analytical models for the performance estimation of GPUs...
Abstract — GPU has become a first-order computing plat-form. Nonetheless, not many performance model...
Abstract—To exploit the abundant computational power of the world’s fastest supercomputers, an even ...
The significant growth in computational power of mod-ern Graphics Processing Units(GPUs) coupled wit...
The significant growth in computational power of modern Graphics Processing Units (GPUs) coupled wit...
Abstract. Using Graphics Processing Units (GPUs) to solve general purpose problems has received sign...
Part 6: Poster SessionsInternational audienceUsing Graphics Processing Units (GPUs) to solve general...
Communicated by Susumu Matsumae In this paper we describe our performance-breakdown model for GPU pr...
One of the major challenges faced by the HPC community today is user-friendly and accurate heterogen...
We present an efficient model to analyze and improve the performance of general-purpose computation ...
Computing systems today rely on massively parallel and heterogeneous architectures to promise very h...
We develop a microbenchmark-based performance model for NVIDIA GeForce 200-series GPUs. Our model id...
We develop a microbenchmark-based performance model for NVIDIA GeForce 200-series GPUs. Our model id...
The increasing programmability, performance, and cost/effectiveness of GPUs have led to a widespread...
High-level tools for analyzing and predicting the performance GPU-accelerated applications are scarc...
This work presents an in-depth study of the analytical models for the performance estimation of GPUs...
Abstract — GPU has become a first-order computing plat-form. Nonetheless, not many performance model...
Abstract—To exploit the abundant computational power of the world’s fastest supercomputers, an even ...
The significant growth in computational power of mod-ern Graphics Processing Units(GPUs) coupled wit...
The significant growth in computational power of modern Graphics Processing Units (GPUs) coupled wit...
Abstract. Using Graphics Processing Units (GPUs) to solve general purpose problems has received sign...
Part 6: Poster SessionsInternational audienceUsing Graphics Processing Units (GPUs) to solve general...
Communicated by Susumu Matsumae In this paper we describe our performance-breakdown model for GPU pr...
One of the major challenges faced by the HPC community today is user-friendly and accurate heterogen...
We present an efficient model to analyze and improve the performance of general-purpose computation ...