A major trend in high performance computer architecture over the last two decades is the migration of memory in the form of high speed caches onto the microprocessor semiconductor die. Where temporal locality in the computation is high, caches prove very effective at hiding memory access latency and contention for communication resources. However where temporal locality is absent, caches may exhibit low hit rates resulting in poor operational efficiency. Vector computing exploiting pipelined arithmetic units and memory access address this challenge for certain forms of data access patterns, for example involving long contiguous data sets exhibiting high spatial locality. But for many advanced applications for science, technology, and nation...
The dominance of machine learning and the ending of Moore's law have renewed interests in Processor ...
Journal ArticleConventional microarchitectures choose a single memory hierarchy design point target...
A number of vector supercomputers feature very large memories. Unfortunately the large capacity memo...
A major trend in high performance computer architecture over the last two decades is the migration o...
Many high performance applications run well below the peak arithmetic performance of the underlying ...
Many high performance applications run well below the peak arithmetic performance of the underlying...
The explosive increase in data volume in emerging applications poses grand challenges to computing s...
Increasing levels of VLSI integration present new opportunities, and new challenges, for designers o...
Processing-in-memory (PIM) has been explored for decades by computer architects, yet it has never se...
Many modern workloads, such as neural networks, databases, and graph processing, are fundamentally m...
This paper discusses the importance of memory access optimizations which are shown to be highly effe...
While both processing and memory architectures are rapidly improving in performance, memory architec...
This report details the accomplishments of the 'Building More Powerful Less Expensive Supercomputers...
Many modern workloads, such as neural networks, databases, and graph processing, are fundamentally m...
International audienceThis paper introduces a new combination of software and hardware PIM (Process-...
The dominance of machine learning and the ending of Moore's law have renewed interests in Processor ...
Journal ArticleConventional microarchitectures choose a single memory hierarchy design point target...
A number of vector supercomputers feature very large memories. Unfortunately the large capacity memo...
A major trend in high performance computer architecture over the last two decades is the migration o...
Many high performance applications run well below the peak arithmetic performance of the underlying ...
Many high performance applications run well below the peak arithmetic performance of the underlying...
The explosive increase in data volume in emerging applications poses grand challenges to computing s...
Increasing levels of VLSI integration present new opportunities, and new challenges, for designers o...
Processing-in-memory (PIM) has been explored for decades by computer architects, yet it has never se...
Many modern workloads, such as neural networks, databases, and graph processing, are fundamentally m...
This paper discusses the importance of memory access optimizations which are shown to be highly effe...
While both processing and memory architectures are rapidly improving in performance, memory architec...
This report details the accomplishments of the 'Building More Powerful Less Expensive Supercomputers...
Many modern workloads, such as neural networks, databases, and graph processing, are fundamentally m...
International audienceThis paper introduces a new combination of software and hardware PIM (Process-...
The dominance of machine learning and the ending of Moore's law have renewed interests in Processor ...
Journal ArticleConventional microarchitectures choose a single memory hierarchy design point target...
A number of vector supercomputers feature very large memories. Unfortunately the large capacity memo...