Distributed local memories, or scratchpads, have been shown to e#ectively reduce cost and power consumption of application-specific accelerators while maintaining performance. The design of the local memory organization must take several factors into account, including the memory bandwidth and size requirements of the program and the distribution of program data among the memories. In addition, when register structures and function units in the accelerator are clustered, the e#ects of intercluster communication should be taken into account. This work proposes a technique to synthesize the local memory architecture of a clustered accelerator using a phase-ordered approach. First, the dataflow graph is pre-partitioned to define a performance-...
As we witness the breakdown of Dennard scaling, we can no longer get faster computers by shrinking t...
International audiencePower and programming challenges make heterogeneous multi-cores composed of co...
In many domains, accelerators---such as graphic processing units (GPUs) and field programmable gate ...
In modern system-on-chip architectures, specialized accelerators are increasingly used to improve pe...
While programmable accelerators such as application-specific processors and reconfigurable architect...
Memory-intensive implementations often require access to an external, off-chip memory which can subs...
Memory partitioning is an eective approach to memory energy optimization in embedded systems. Spatia...
Abstract: Memory management is one of the key challenges in the design of embed-ded systems where me...
In order to meet the requirements concerning both performance and energy consumption in embedded sy...
The design of specialized accelerators is essential to the success of many modern Systems-on-Chip. E...
Power constraints led to the end of exponential growth in single–processor performance, which charac...
This paper deals with methods of automated memory synthesis for Custom Computing Machines (CCMs). We...
Abstract The combination of growing transistor counts and limited power budget within a silicon die ...
In light of the failure of Dennard scaling and recent slowdown of Moore's Law, both industry and aca...
The adoption of High-Level Synthesis (HLS) tools has significantly reduced accelerator design time. ...
As we witness the breakdown of Dennard scaling, we can no longer get faster computers by shrinking t...
International audiencePower and programming challenges make heterogeneous multi-cores composed of co...
In many domains, accelerators---such as graphic processing units (GPUs) and field programmable gate ...
In modern system-on-chip architectures, specialized accelerators are increasingly used to improve pe...
While programmable accelerators such as application-specific processors and reconfigurable architect...
Memory-intensive implementations often require access to an external, off-chip memory which can subs...
Memory partitioning is an eective approach to memory energy optimization in embedded systems. Spatia...
Abstract: Memory management is one of the key challenges in the design of embed-ded systems where me...
In order to meet the requirements concerning both performance and energy consumption in embedded sy...
The design of specialized accelerators is essential to the success of many modern Systems-on-Chip. E...
Power constraints led to the end of exponential growth in single–processor performance, which charac...
This paper deals with methods of automated memory synthesis for Custom Computing Machines (CCMs). We...
Abstract The combination of growing transistor counts and limited power budget within a silicon die ...
In light of the failure of Dennard scaling and recent slowdown of Moore's Law, both industry and aca...
The adoption of High-Level Synthesis (HLS) tools has significantly reduced accelerator design time. ...
As we witness the breakdown of Dennard scaling, we can no longer get faster computers by shrinking t...
International audiencePower and programming challenges make heterogeneous multi-cores composed of co...
In many domains, accelerators---such as graphic processing units (GPUs) and field programmable gate ...