Modern data centers are increasingly employing GPUs to accelerate services. These GPUs are commonly used to processes Neural Network-based requests such as, image classification, speech recognition and natural language processing. However, current GPUs have poor built in power management and are not optimized for varying request levels that are typical in data centers and cloud computing. In this work, we first characterize dynamic power management on real GPUs. We show a non linear diminishing relationship between frequency and power. To overcome this constraint, we explore the possible effects of Thread Block scaling to increase throughput. Our Thread Block scaling characterization shows the number of thread blocks per request can be limi...
Computational power of embedded graphics processing units (GPUs) in mobile system-on-chips has been ...
General-purpose graphics processing units (GPGPUs), due to their enormous parallelism, have found ub...
Improving energy efficiency is an ongoing challenge in HPC because of the ever-increasing need for p...
Power-performance efficiency has become a central focus that is challenging in heterogeneous process...
Thread parallel hardware, as the Graphics Processing Units (GPUs), greatly outperform CPUs in provid...
General-purpose GPUs (GPGPUs) are becoming prevalent in mainstream computing, and performance per wa...
Graph analysis is a fundamental building block in numerous computing domains. Recent research has lo...
Graphics processing units (GPUs) provide signifi-cant improvements in performance and performance-pe...
Graphic Processing Units (GPUs) are widely used in high performance computing, due to their high com...
Graphics processing units (GPUs) have become prevalent in modern computing systems. While their high...
The largest part of routers and switches, today deployed in production networks, has very limited en...
Today, hardware accelerators are widely accepted as a cost-effective solution for emerging applicati...
The massive parallelism provided by general-purpose GPUs (GPGPUs) possessing numerous compute thread...
Energy optimization is an increasingly important aspect of today's high-performance computing applic...
Graphics Processing Units (GPUs) have been predominantly accepted for various general purpose applic...
Computational power of embedded graphics processing units (GPUs) in mobile system-on-chips has been ...
General-purpose graphics processing units (GPGPUs), due to their enormous parallelism, have found ub...
Improving energy efficiency is an ongoing challenge in HPC because of the ever-increasing need for p...
Power-performance efficiency has become a central focus that is challenging in heterogeneous process...
Thread parallel hardware, as the Graphics Processing Units (GPUs), greatly outperform CPUs in provid...
General-purpose GPUs (GPGPUs) are becoming prevalent in mainstream computing, and performance per wa...
Graph analysis is a fundamental building block in numerous computing domains. Recent research has lo...
Graphics processing units (GPUs) provide signifi-cant improvements in performance and performance-pe...
Graphic Processing Units (GPUs) are widely used in high performance computing, due to their high com...
Graphics processing units (GPUs) have become prevalent in modern computing systems. While their high...
The largest part of routers and switches, today deployed in production networks, has very limited en...
Today, hardware accelerators are widely accepted as a cost-effective solution for emerging applicati...
The massive parallelism provided by general-purpose GPUs (GPGPUs) possessing numerous compute thread...
Energy optimization is an increasingly important aspect of today's high-performance computing applic...
Graphics Processing Units (GPUs) have been predominantly accepted for various general purpose applic...
Computational power of embedded graphics processing units (GPUs) in mobile system-on-chips has been ...
General-purpose graphics processing units (GPGPUs), due to their enormous parallelism, have found ub...
Improving energy efficiency is an ongoing challenge in HPC because of the ever-increasing need for p...