Emerging heterogeneous hardware systems and applications that have shared data between multiple CPU cores and computation accelerators bring the need for efficient and flexible cache coherence support. Since different devices like CPUs, GPUs and accelerators have diverse memory demands and different data-sharing patterns, Spandex was proposed to efficiently integrate devices with different cache coherence protocols. The flexibility, simplicity and scalability of Spandex make it suitable for maintaining cache coherence in complicated SoCs. In addition, the introduction of Flexible Coherence Specialization (FCS) in Spandex further improves the granularity of flexibility from device granularity to address and request granularity. However, even...
In embedded system-on-a-chip (SoC) applications, the demand for integrating heterogeneous processors...
Cache coherence and memory consistency are of the most decisive and challenging issues in the design...
In this paper, we study a hardware-supported, compilerdirected (HSCD) cache coherence scheme, which ...
System-on-a-chip (SoC) designs is characterized by heavy reuse of IP blocks to satisfy specific comp...
Many future heterogeneous systems will integrate CPUs and GPUs physically on a single chip and logic...
Graphics Processing Units (GPUs) have been shown to be effective at achieving large speedups over co...
The end of Dennard scaling and Moore's law has motivated a rise in the use of parallelism and hardwa...
Field-Programmable Gate Arrays (FPGAs) systems now comprise many processing elements that are proce...
New generation System-on-Chips will be extremely complex devices, composed from complex subsystems, ...
For a long time, most discrete accelerators have been attached to host systems using various generat...
In a multiprocessor system-on-chip (MPSoC) private caches introduce the cache coherence problem. Her...
In modern techniques of building processors, manufactures using more than one processor in the integ...
Cache attacks are widespread on microprocessors and multi-processor system-on-chips but have not yet...
2018-02-23Graphics Processing Units (GPUs) are designed primarily to execute multimedia, and game re...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
In embedded system-on-a-chip (SoC) applications, the demand for integrating heterogeneous processors...
Cache coherence and memory consistency are of the most decisive and challenging issues in the design...
In this paper, we study a hardware-supported, compilerdirected (HSCD) cache coherence scheme, which ...
System-on-a-chip (SoC) designs is characterized by heavy reuse of IP blocks to satisfy specific comp...
Many future heterogeneous systems will integrate CPUs and GPUs physically on a single chip and logic...
Graphics Processing Units (GPUs) have been shown to be effective at achieving large speedups over co...
The end of Dennard scaling and Moore's law has motivated a rise in the use of parallelism and hardwa...
Field-Programmable Gate Arrays (FPGAs) systems now comprise many processing elements that are proce...
New generation System-on-Chips will be extremely complex devices, composed from complex subsystems, ...
For a long time, most discrete accelerators have been attached to host systems using various generat...
In a multiprocessor system-on-chip (MPSoC) private caches introduce the cache coherence problem. Her...
In modern techniques of building processors, manufactures using more than one processor in the integ...
Cache attacks are widespread on microprocessors and multi-processor system-on-chips but have not yet...
2018-02-23Graphics Processing Units (GPUs) are designed primarily to execute multimedia, and game re...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
In embedded system-on-a-chip (SoC) applications, the demand for integrating heterogeneous processors...
Cache coherence and memory consistency are of the most decisive and challenging issues in the design...
In this paper, we study a hardware-supported, compilerdirected (HSCD) cache coherence scheme, which ...