Many architectural ideas that appear to be useful from a hardware standpoint fail to achieve wide acceptance due to lack of compiler support. In this paper we explore the design of the VIRAM architecture from the perspective of compiler writers, describing some of the code generation problems that arise in VIRAM and their solutions in the VIRAM compiler. VIRAM is a single chip system designed primarily for multi-media. It combines vector processing with mixed logic and DRAM to acheive high performance with relatively low energy, area, and design complexity. The paper focuses on two aspects of the VIRAM compiler and architecture. The first problem is to take advantage of the onchip bandwidth for memory-intensive applications, include those w...
Most of the current processor architectures have word-addressable internal memories and wide data pa...
In numerous mobile applications involving complex video, image, signal, communication or security pr...
Despite the success of parallel architectures and domain-specific accelerators in boosting the perfo...
This work presents two emerging media microprocessors, VIRAM and Imagine, and comparesthe implement...
This work presents two emerging media microprocessors, VIRAM and Imagine, and compares the implement...
Many high performance applications run well below the peak arithmetic performance of the underlying...
Many high performance applications run well below the peak arithmetic performance of the underlying ...
Many high performance applications run well below the peak arithmetic performance of the underlying...
Before it can achieve wide acceptance,parallel computation must be made significantly easier to prog...
Power consumption and fabrication limitations are increasingly playing significant roles in the desi...
Vector IRAM integrates vector processing with embedded DRAM on a single chip to provide high multime...
To accomodate standards changes and algorithmic im-provements, functional reconfigurability is incre...
The increasing gap between processor and memory performance has led to new architectural models for...
The growing interest that multimedia processing has experimented during the last decade is motivatin...
We present a taxonomy and modular implementation approach for data-parallel accelerators, including ...
Most of the current processor architectures have word-addressable internal memories and wide data pa...
In numerous mobile applications involving complex video, image, signal, communication or security pr...
Despite the success of parallel architectures and domain-specific accelerators in boosting the perfo...
This work presents two emerging media microprocessors, VIRAM and Imagine, and comparesthe implement...
This work presents two emerging media microprocessors, VIRAM and Imagine, and compares the implement...
Many high performance applications run well below the peak arithmetic performance of the underlying...
Many high performance applications run well below the peak arithmetic performance of the underlying ...
Many high performance applications run well below the peak arithmetic performance of the underlying...
Before it can achieve wide acceptance,parallel computation must be made significantly easier to prog...
Power consumption and fabrication limitations are increasingly playing significant roles in the desi...
Vector IRAM integrates vector processing with embedded DRAM on a single chip to provide high multime...
To accomodate standards changes and algorithmic im-provements, functional reconfigurability is incre...
The increasing gap between processor and memory performance has led to new architectural models for...
The growing interest that multimedia processing has experimented during the last decade is motivatin...
We present a taxonomy and modular implementation approach for data-parallel accelerators, including ...
Most of the current processor architectures have word-addressable internal memories and wide data pa...
In numerous mobile applications involving complex video, image, signal, communication or security pr...
Despite the success of parallel architectures and domain-specific accelerators in boosting the perfo...