Tuning GPU applications is a very challenging task as any source-code optimization can sensibly impact performance, power, and energy consumption of the GPU device. Such an impact also depends on the GPU on which the application is run. This paper presents a suite of microbenchmarks that provides the actual characteristics of specific GPU device components (e.g., arithmetic instruction units, memories, etc.) in terms of throughput, power, and energy consumption. It shows how the suite can be combined to standard profiler information to efficiently drive the application tuning by considering the three design constraints (power, performance, energy consumption) and the characteristics of the target GPU device
Massive GPU acceleration processors have been used in high-performance computing systems. The Dennar...
On-chip parallelism with GPU accelerators is now ubiquitous and has received significant attention i...
Dynamic voltage and frequency scaling (DVFS) is an important solution to balance performance and ene...
GPU-accelerated applications are becoming increasingly common in high-performance computing as well ...
The increasing programmability, performance, and cost/effectiveness of GPUs have led to a widespread...
GPUs are widely being used to meet the ever increasing demands of High performance computing. High-e...
Graphics Processing Units (GPUs) have revolutionized the computing landscape over the past decade. H...
General-purpose GPUs (GPGPUs) are becoming prevalent in mainstream computing, and performance per wa...
Abstract—Graphics processing units (GPUs) provide an order-of-magnitude improvement on peak performa...
Low-power GPUs have become ubiquitous, they can be found in domains ranging from wearable and mobile...
Energy optimization is an increasingly important aspect of today’s high-performance computing applic...
Power-performance efficiency has become a central focus that is challenging in heterogeneous process...
The overarching goal of this thesis is to provide an algorithm-centric approach to analyzing the rel...
Abstract- Future computing systems, from handhelds to su-percomputers, will undoubtedly be more para...
Existing architectural power models for GPUs count activities such as executing floating point or in...
Massive GPU acceleration processors have been used in high-performance computing systems. The Dennar...
On-chip parallelism with GPU accelerators is now ubiquitous and has received significant attention i...
Dynamic voltage and frequency scaling (DVFS) is an important solution to balance performance and ene...
GPU-accelerated applications are becoming increasingly common in high-performance computing as well ...
The increasing programmability, performance, and cost/effectiveness of GPUs have led to a widespread...
GPUs are widely being used to meet the ever increasing demands of High performance computing. High-e...
Graphics Processing Units (GPUs) have revolutionized the computing landscape over the past decade. H...
General-purpose GPUs (GPGPUs) are becoming prevalent in mainstream computing, and performance per wa...
Abstract—Graphics processing units (GPUs) provide an order-of-magnitude improvement on peak performa...
Low-power GPUs have become ubiquitous, they can be found in domains ranging from wearable and mobile...
Energy optimization is an increasingly important aspect of today’s high-performance computing applic...
Power-performance efficiency has become a central focus that is challenging in heterogeneous process...
The overarching goal of this thesis is to provide an algorithm-centric approach to analyzing the rel...
Abstract- Future computing systems, from handhelds to su-percomputers, will undoubtedly be more para...
Existing architectural power models for GPUs count activities such as executing floating point or in...
Massive GPU acceleration processors have been used in high-performance computing systems. The Dennar...
On-chip parallelism with GPU accelerators is now ubiquitous and has received significant attention i...
Dynamic voltage and frequency scaling (DVFS) is an important solution to balance performance and ene...