The memory requirements of emerging applications, especially in the domain of machine learn- ing workloads, is outpacing the capacity of traditional memory devices like DRAM. At the same time, heterogeneity in the memory hierarchy is emerging on multiple fronts both with high-capacity, low-bandwidth devices like Intel Optane Data-Center (DC) Persistent Memory Modules (PMM), and low-capacity, high-bandwidth devices like High Bandwidth Memory (HBM). A fundamental question introduced by this heterogeneity is: how do we efficiently manage application data to fully exploit the properties of the underlying memory technologies? This work explores techniques and ideas towards answering this question and understanding the performance implications of ...
There has been a recent emergence of applications from the domain of machine learning, data mining, ...
We propose to overcome the memory capacity limitation of GPUs with a Heterogeneous Memory Stack (HMS...
To address the 'memory wall' problem of future systems, vendors are creating heterogeneous memory st...
The memory requirements of emerging applications, especially in the domain of machine learn- ing wor...
Conventional compute and memory systems scaling to achieve higher performance and lower cost and pow...
Persistent Memory (PMEM), also known as Non-Volatile Memory (NVM), can deliver higher density and lo...
Hardware heterogeneity is becoming an increasingly common feature in high-performance computing syst...
The memory system has been evolving at a fast pace recently, driven by the emergence of large-scale ...
DRAM caches are important for enabling effective heterogeneous memory systems that can transparently...
Many promising memory technologies, such as non-volatile, storage-class memories and high-bandwidth,...
International audienceOver the past decades, the performance gap between the memory subsystem and co...
pre-printThe DRAM main memory system in modern servers is largely homogeneous. In recent years, DRAM...
As device technologies scale in the nanometer era, the current off-chip DRAM technologies are very c...
Training machine learning (ML) algorithms is a computationally intensive process, which is frequentl...
Achieving high application performance depends on the combination of memory footprint, instruction m...
There has been a recent emergence of applications from the domain of machine learning, data mining, ...
We propose to overcome the memory capacity limitation of GPUs with a Heterogeneous Memory Stack (HMS...
To address the 'memory wall' problem of future systems, vendors are creating heterogeneous memory st...
The memory requirements of emerging applications, especially in the domain of machine learn- ing wor...
Conventional compute and memory systems scaling to achieve higher performance and lower cost and pow...
Persistent Memory (PMEM), also known as Non-Volatile Memory (NVM), can deliver higher density and lo...
Hardware heterogeneity is becoming an increasingly common feature in high-performance computing syst...
The memory system has been evolving at a fast pace recently, driven by the emergence of large-scale ...
DRAM caches are important for enabling effective heterogeneous memory systems that can transparently...
Many promising memory technologies, such as non-volatile, storage-class memories and high-bandwidth,...
International audienceOver the past decades, the performance gap between the memory subsystem and co...
pre-printThe DRAM main memory system in modern servers is largely homogeneous. In recent years, DRAM...
As device technologies scale in the nanometer era, the current off-chip DRAM technologies are very c...
Training machine learning (ML) algorithms is a computationally intensive process, which is frequentl...
Achieving high application performance depends on the combination of memory footprint, instruction m...
There has been a recent emergence of applications from the domain of machine learning, data mining, ...
We propose to overcome the memory capacity limitation of GPUs with a Heterogeneous Memory Stack (HMS...
To address the 'memory wall' problem of future systems, vendors are creating heterogeneous memory st...