Heterogeneous systems combine general-purpose CPUs with domain-specific accelerators like GPUs. Recent heterogeneous system designs have enabled GPUs to request OS services, but the domain-specific nature of accelerators means that they must rely on the CPUs to handle these requests. Such system service requests can unintentionally harm the performance of unrelated CPU applications. Tests on a real heterogeneous processor demonstrate that GPU system service requests can degrade contemporaneous CPU application performance by up to 44% and can reduce energy efficiency by limiting CPU sleep time. The reliance on busy CPU cores to perform the system services can also slow down GPU work by up to 18%. This new form of interference is found only i...
Integrated CPU-GPU architecture provides excellent acceleration capabilities for data parallel appli...
Reconfigurable heterogeneous systems-on-chips (SoCs) integrating multiple accelerators are cost-effe...
<p>The continued growth of the computational capability of throughput processors has made throughput...
<p>Heterogeneous architectures consisting of general-purpose CPUs and throughput-optimized GPUs are ...
Abstract—Heterogeneous architectures consisting of general-purpose CPUs and throughput-optimized GPU...
Graphic Processing Units (GPUs) are widely used in high performance computing, due to their high com...
Accelerated computing has become pervasive for increasing the computational power and energy efficie...
In High-Performance Computing (HPC), GPU-based accelerators are pervasive for two reasons: first, GP...
Power-performance efficiency has become a central focus that is challenging in heterogeneous process...
In heterogeneous CPU+GPU SoCs where a single DRAM is shared between both devices, concurrent memory ...
Heterogeneous parallel architectures like those comprised of CPUs and GPUs are a tantalizing compute...
In recent years the power wall has prevented the continued scaling of single core performance. This ...
This paper examines energy management in a heterogeneous processor consisting of an integrated CPU-G...
High compute-density with massive thread-level parallelism of Graphics Processing Units (GPUs) is be...
Current heterogeneous CPU-GPU architectures integrate general purpose CPUs and highly thread-level p...
Integrated CPU-GPU architecture provides excellent acceleration capabilities for data parallel appli...
Reconfigurable heterogeneous systems-on-chips (SoCs) integrating multiple accelerators are cost-effe...
<p>The continued growth of the computational capability of throughput processors has made throughput...
<p>Heterogeneous architectures consisting of general-purpose CPUs and throughput-optimized GPUs are ...
Abstract—Heterogeneous architectures consisting of general-purpose CPUs and throughput-optimized GPU...
Graphic Processing Units (GPUs) are widely used in high performance computing, due to their high com...
Accelerated computing has become pervasive for increasing the computational power and energy efficie...
In High-Performance Computing (HPC), GPU-based accelerators are pervasive for two reasons: first, GP...
Power-performance efficiency has become a central focus that is challenging in heterogeneous process...
In heterogeneous CPU+GPU SoCs where a single DRAM is shared between both devices, concurrent memory ...
Heterogeneous parallel architectures like those comprised of CPUs and GPUs are a tantalizing compute...
In recent years the power wall has prevented the continued scaling of single core performance. This ...
This paper examines energy management in a heterogeneous processor consisting of an integrated CPU-G...
High compute-density with massive thread-level parallelism of Graphics Processing Units (GPUs) is be...
Current heterogeneous CPU-GPU architectures integrate general purpose CPUs and highly thread-level p...
Integrated CPU-GPU architecture provides excellent acceleration capabilities for data parallel appli...
Reconfigurable heterogeneous systems-on-chips (SoCs) integrating multiple accelerators are cost-effe...
<p>The continued growth of the computational capability of throughput processors has made throughput...