In this paper we study the performance improvements and trade-offs derived from an optimized mapping approach applied on a parametric coarse grained reconfigurable array architecture. The processing elements ’ local register files and the processing elements’ interconnection network is exploited for caching memory data values with data reuse opportunities. The data reused values are transferred through the processing elements’ interconnection network hence, relieving the bus from the burden of transferring these values. A novel mapping algorithm is also proposed that uses a modulo scheduling technique. This algorithm targets on a flexible architecture template which permits experimental exploration over different architecture alternatives. ...
Exploring the design space of an optimized compiler approach for mesh-like coarse-grained reconfigur...
Abstract Coarse-Grained Reconfigurable Array (CGRA) architectures accelerate the same inner loops th...
This paper introduces a method which can be used to map applications written in a high level source ...
Abstract: It is widely known that bandwidth limitations degrade parallel systems ’ performance. This...
This paper presents a compiler methodology for memory-aware mapping on 2-Dimensional coarse-grained ...
Coarse-grained reconfigurable arrays (CGRAs) are a very promising platform, providing both up to 10-...
With the increasing demand for flexible yet highly efficient architecture platforms for media applic...
Coarse-Grained Reconfigurable Arrays (CGRAs) are a very promising platform, providing both, up to 10...
UnrestrictedConfigurable architectures offer the unique opportunity of realizing hardware designs ta...
Coarse-grain reconfigurable architectures consist of a large number of processing elements (PEs) con...
Coarse-grained reconfigurable architectures can enhance the performance of critical loops and comput...
This thesis investigates the impact of the global and local register file architecture on a reconfig...
Today the most commonly used system architectures in data processing can be divided into three categ...
We propose that, in order to meet high computational demands, the application development has to be ...
This paper presents various novel techniques for improving coarse-grained reconfigurable architectur...
Exploring the design space of an optimized compiler approach for mesh-like coarse-grained reconfigur...
Abstract Coarse-Grained Reconfigurable Array (CGRA) architectures accelerate the same inner loops th...
This paper introduces a method which can be used to map applications written in a high level source ...
Abstract: It is widely known that bandwidth limitations degrade parallel systems ’ performance. This...
This paper presents a compiler methodology for memory-aware mapping on 2-Dimensional coarse-grained ...
Coarse-grained reconfigurable arrays (CGRAs) are a very promising platform, providing both up to 10-...
With the increasing demand for flexible yet highly efficient architecture platforms for media applic...
Coarse-Grained Reconfigurable Arrays (CGRAs) are a very promising platform, providing both, up to 10...
UnrestrictedConfigurable architectures offer the unique opportunity of realizing hardware designs ta...
Coarse-grain reconfigurable architectures consist of a large number of processing elements (PEs) con...
Coarse-grained reconfigurable architectures can enhance the performance of critical loops and comput...
This thesis investigates the impact of the global and local register file architecture on a reconfig...
Today the most commonly used system architectures in data processing can be divided into three categ...
We propose that, in order to meet high computational demands, the application development has to be ...
This paper presents various novel techniques for improving coarse-grained reconfigurable architectur...
Exploring the design space of an optimized compiler approach for mesh-like coarse-grained reconfigur...
Abstract Coarse-Grained Reconfigurable Array (CGRA) architectures accelerate the same inner loops th...
This paper introduces a method which can be used to map applications written in a high level source ...