General purpose graphics processing units (GP-GPU), owing to their enormous thread-level parallelism, can significantly improve the power consumption at the near-threshold (NTC) operating region, while offering close to a super-threshold performance. However, process variation (PV) can drastically reduce the GPU performance at NTC. In this work, choke points—a unique device-level characteristic of PV at NTC—that can exacerbate the warp criticality problem in GPUs have been explored. It is shown that the modern warp schedulers cannot tackle the choke point induced critical warps in an NTC GPU. Additionally, Choke Point Aware Warp Speculator, a circuit-architectural solution is proposed to dynamically predict the critical warps in GPUs, and a...
High throughput architectures rely on high thread-level parallelism (TLP) to hide execution latencie...
The ability to perform fast context-switching and massive multi-threading is the forte of modern GPU...
The ability to perform fast context-switching and massive multi-threading is the forte of modern GPU...
General-purpose graphics processing units (GPGPUs), due to their enormous parallelism, have found ub...
Over the last decade, General Purpose Graphics Processing Units (GPGPUs) have garnered a substantial...
Over the last decade, Graphics Processing Units (GPUs) have been used extensively in gaming consoles...
Over the last decade, Graphics Processing Units (GPUs) have been used extensively in gaming consoles...
Over the last decade, Graphics Processing Units (GPUs) have been used extensively in gaming consoles...
abstract: With the massive multithreading execution feature, graphics processing units (GPUs) have b...
The growing market of portable devices and smart wearables has contributed to innovation and develop...
There has been a tremendous growth in the use of Graphics Processing Units (GPU) for the acceleratio...
The growing market of portable devices and smart wearables has contributed to innovation and develop...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Near-threshold computing (NTC) has the potential to significantly improve efficiency in high through...
High throughput architectures rely on high thread-level parallelism (TLP) to hide execution latencie...
The ability to perform fast context-switching and massive multi-threading is the forte of modern GPU...
The ability to perform fast context-switching and massive multi-threading is the forte of modern GPU...
General-purpose graphics processing units (GPGPUs), due to their enormous parallelism, have found ub...
Over the last decade, General Purpose Graphics Processing Units (GPGPUs) have garnered a substantial...
Over the last decade, Graphics Processing Units (GPUs) have been used extensively in gaming consoles...
Over the last decade, Graphics Processing Units (GPUs) have been used extensively in gaming consoles...
Over the last decade, Graphics Processing Units (GPUs) have been used extensively in gaming consoles...
abstract: With the massive multithreading execution feature, graphics processing units (GPUs) have b...
The growing market of portable devices and smart wearables has contributed to innovation and develop...
There has been a tremendous growth in the use of Graphics Processing Units (GPU) for the acceleratio...
The growing market of portable devices and smart wearables has contributed to innovation and develop...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Near-threshold computing (NTC) has the potential to significantly improve efficiency in high through...
High throughput architectures rely on high thread-level parallelism (TLP) to hide execution latencie...
The ability to perform fast context-switching and massive multi-threading is the forte of modern GPU...
The ability to perform fast context-switching and massive multi-threading is the forte of modern GPU...