Modern systems-on-chip (SoCs) include not only general-purpose CPUs but also specialized hardware accelerators. Typically, there are three coherence model choices to integrate an accelerator with the memory hierarchy: no coherence, coherent with the last-level cache (LLC), and private cache based full coherence. However, there has been very limited research on finding which coherence models are optimal for the accelerators of a complex many-Accelerator SoC. This paper focuses on determining a cost-Aware coherence interface for an SoC and its target application: find the best coherence models for the accelerators that optimize their power and performance, considering both workload characteristics and system-level contention. A novel comprehe...
There is a large, emerging, and commercially relevant class of applications which stands to be enabl...
With the advancement of design and fabrication of high-performance integrated circuits technology, i...
Journal ArticleImprovements in semiconductor technology have made it possible to include multiple p...
The modern system-on-chip (SoC) of the current exascale computing era is complex. These SoCs not onl...
International audienceModern SoC systems consist of general-purpose processor cores augmented with l...
System-on-a-chip (SoC) designs is characterized by heavy reuse of IP blocks to satisfy specific comp...
The end of Dennard scaling and Moore's law has motivated a rise in the use of parallelism and hardwa...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
Caches have the potential to provide multiprocessors with an automatic mechanism for reducing both n...
This work describes a cache architecture and memory model for 1000+ core microprocessors. Our appro...
Multicore computing have presented many challenges for system designers; one of which is data consis...
Shared memory is a common interprocessor communi-cation paradigm for single-chip multi-processor pla...
New generation System-on-Chips will be extremely complex devices, composed from complex subsystems, ...
In heterogeneous computer architectures, the serial part of an application is coupled with domain-sp...
Graphics Processing Units (GPUs) have been shown to be effective at achieving large speedups over co...
There is a large, emerging, and commercially relevant class of applications which stands to be enabl...
With the advancement of design and fabrication of high-performance integrated circuits technology, i...
Journal ArticleImprovements in semiconductor technology have made it possible to include multiple p...
The modern system-on-chip (SoC) of the current exascale computing era is complex. These SoCs not onl...
International audienceModern SoC systems consist of general-purpose processor cores augmented with l...
System-on-a-chip (SoC) designs is characterized by heavy reuse of IP blocks to satisfy specific comp...
The end of Dennard scaling and Moore's law has motivated a rise in the use of parallelism and hardwa...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
Caches have the potential to provide multiprocessors with an automatic mechanism for reducing both n...
This work describes a cache architecture and memory model for 1000+ core microprocessors. Our appro...
Multicore computing have presented many challenges for system designers; one of which is data consis...
Shared memory is a common interprocessor communi-cation paradigm for single-chip multi-processor pla...
New generation System-on-Chips will be extremely complex devices, composed from complex subsystems, ...
In heterogeneous computer architectures, the serial part of an application is coupled with domain-sp...
Graphics Processing Units (GPUs) have been shown to be effective at achieving large speedups over co...
There is a large, emerging, and commercially relevant class of applications which stands to be enabl...
With the advancement of design and fabrication of high-performance integrated circuits technology, i...
Journal ArticleImprovements in semiconductor technology have made it possible to include multiple p...