The increasing gap between processor and memory performance has led to new architectural models for memory-intensive applications. In this paper, we explore the performance of a set of memory-intensive benchmarks and use them to compare the performance of conventional cache-based microprocessors to a mixed logic and DRAM processor called VIRAM. The benchmarks are based on problem statements, rather than specific implementations, and in each case we explore the fundamental hardware requirements of the problem, as well as alternative algorithms and data structures that can help expose fine-grained parallelism or simplify memory access patterns. The benchmarks are characterized by their memory access patterns, their basic structures, ...
While preparing material for an article describing fundamental performance characteristics of Cray P...
This paper presents the architecture of a high-performance intermediate-level memory subsystem. The ...
Microprocessors and memory systems suffer from a growing gap in performance. We introduce Active Pag...
Computer memory systems are increasingly a bottleneck limiting application performance. IRAM archite...
Many high performance applications run well below the peak arithmetic performance of the underlying ...
Many high performance applications run well below the peak arithmetic performance of the underlying...
Computer memory systems are increasingly a bottleneck limiting the performance of applications and o...
Many modern workloads, such as neural networks, databases, and graph processing, are fundamentally m...
Computer memory is organized into a hierarchy. At the highest level are the processor registers, nex...
Abstract|As the performance gap between processors and main memory continues to widen, increasingly ...
The latest CPUs(computer cpu processors) employ multiple cores, massively superscalar pipelines, out...
Dynamic Random Access Memories (DRAM) are the dominant solid-state memory devices used for primary m...
Many high performance applications run well below the peak arithmetic performance of the underlying...
Abstract: Two trends call into question the current practice of microprocessors and DRAMs being fabr...
Application performance on modern microprocessors depends heavily on performance related characteris...
While preparing material for an article describing fundamental performance characteristics of Cray P...
This paper presents the architecture of a high-performance intermediate-level memory subsystem. The ...
Microprocessors and memory systems suffer from a growing gap in performance. We introduce Active Pag...
Computer memory systems are increasingly a bottleneck limiting application performance. IRAM archite...
Many high performance applications run well below the peak arithmetic performance of the underlying ...
Many high performance applications run well below the peak arithmetic performance of the underlying...
Computer memory systems are increasingly a bottleneck limiting the performance of applications and o...
Many modern workloads, such as neural networks, databases, and graph processing, are fundamentally m...
Computer memory is organized into a hierarchy. At the highest level are the processor registers, nex...
Abstract|As the performance gap between processors and main memory continues to widen, increasingly ...
The latest CPUs(computer cpu processors) employ multiple cores, massively superscalar pipelines, out...
Dynamic Random Access Memories (DRAM) are the dominant solid-state memory devices used for primary m...
Many high performance applications run well below the peak arithmetic performance of the underlying...
Abstract: Two trends call into question the current practice of microprocessors and DRAMs being fabr...
Application performance on modern microprocessors depends heavily on performance related characteris...
While preparing material for an article describing fundamental performance characteristics of Cray P...
This paper presents the architecture of a high-performance intermediate-level memory subsystem. The ...
Microprocessors and memory systems suffer from a growing gap in performance. We introduce Active Pag...