Memory-intensive operations and their memory access latency are often the performance bottleneck in parallel applications. In this paper, we investigate the concept of active memory operation which is an active data processing operation performed on the memory side. Utilizing the active memory operation, we can replace multiple transactions of memory accesses over the on-chip network and related computations on the processor side with a smaller number of high-level transactions and computations on the memory side. To realize the concept, we have designed a special-purpose processor called active memory processor which is tightly coupled with the memory and executes the active memory operations. In our case studies, we have applied the conce...
We introduce an architectural approach to improve memory system performance in both uniprocessor and...
We introduce an architectural approach to improve memory system performance in both uniprocessor and...
The associative memory (AM) system is a computing device made of hundreds of AM ASICs chips designed...
The performance of modern microprocessors is increasingly limited by their inability to hide main me...
Recent advances in emulated shared memory architectures have made it possible to exploit the full po...
The explosion of data availability and fast data analytic requirements led to the advent of data-int...
Our recent work on uniprocessor and single-node multiprocessor (SMP) active memory systems uses addr...
This paper presents a new concept called active data bitwidth, which is the effective data length of...
Inability to hide main memory latency has been increasingly limiting the performance of modern proce...
The increasing performance and decreasing cost of processors and memory are causing system intellige...
We show how key insights from our research into active memory systems, coupled with emerging trends ...
This paper presents a new concept called active data bitwidth, which is the effective data length of...
The increasing performance and decreasing cost of processors and memory are causing system intellige...
The associative memory (AM) chip is ASIC device specifically designed to perform ``pattern matching'...
Processing-in-memory architectures promise increased computing performance at decreased costs in ene...
We introduce an architectural approach to improve memory system performance in both uniprocessor and...
We introduce an architectural approach to improve memory system performance in both uniprocessor and...
The associative memory (AM) system is a computing device made of hundreds of AM ASICs chips designed...
The performance of modern microprocessors is increasingly limited by their inability to hide main me...
Recent advances in emulated shared memory architectures have made it possible to exploit the full po...
The explosion of data availability and fast data analytic requirements led to the advent of data-int...
Our recent work on uniprocessor and single-node multiprocessor (SMP) active memory systems uses addr...
This paper presents a new concept called active data bitwidth, which is the effective data length of...
Inability to hide main memory latency has been increasingly limiting the performance of modern proce...
The increasing performance and decreasing cost of processors and memory are causing system intellige...
We show how key insights from our research into active memory systems, coupled with emerging trends ...
This paper presents a new concept called active data bitwidth, which is the effective data length of...
The increasing performance and decreasing cost of processors and memory are causing system intellige...
The associative memory (AM) chip is ASIC device specifically designed to perform ``pattern matching'...
Processing-in-memory architectures promise increased computing performance at decreased costs in ene...
We introduce an architectural approach to improve memory system performance in both uniprocessor and...
We introduce an architectural approach to improve memory system performance in both uniprocessor and...
The associative memory (AM) system is a computing device made of hundreds of AM ASICs chips designed...