Abstract. Increasing on-chip wire delay along with the distributed nature of processing elements, makes instruction scheduling for tiled dataflow architectures very crucial. Our analysis reveals that careful placement of frequently executed sections of applications, and dynamic resource contention tracking can significantly improve the performance of the application. The former reduces the operand network latency, while the latter reduces stalls due to contention for processing elements. We augment one of the most recent instruction scheduling algorithms —hierarchical instruction scheduling —to better exploit spatial locality between instructions within a loop, thereby reducing expensive com-munication overhead by 6.5 % and increasing avera...
This thesis presents a novel approach to the instruction scheduling problem for dynamic issue proces...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
CMOS technology scaling poses challenges in designing dynamically scheduled cores that can sustain b...
Instruction scheduling and Software pipelining are important compilation techniques which reorder in...
The processor speeds continue to improve at a faster rate than the memory access times. The issue of...
This work examines the interaction of compiler scheduling techniques with processor features such as...
Clustering is a common technique to overcome the wire delay problem incurred by the evolution of tec...
textIncreasing bandwidth and decreasing latency are two orthogonal techniques for improving program...
Recent evidence indicates that the exploitation of locality in dataflow programs could have a dramat...
Traditional list schedulers order instructions based on an optimistic estimate of the load latency i...
Graduation date: 2007Dynamic multithreaded processors attempt to increase the performance of a singl...
Recent studies show that very long instruction word (VLIW) architectures, which inherently have wide...
To maximize the performance of wide-issue superscalar out-of-order microprocessors, the issue stage ...
The processor speeds continue to improve at a faster rate than the memory access times. The issue o...
Instruction scheduling algorithms are used in compilers to reduce run-time delays for the compiled c...
This thesis presents a novel approach to the instruction scheduling problem for dynamic issue proces...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
CMOS technology scaling poses challenges in designing dynamically scheduled cores that can sustain b...
Instruction scheduling and Software pipelining are important compilation techniques which reorder in...
The processor speeds continue to improve at a faster rate than the memory access times. The issue of...
This work examines the interaction of compiler scheduling techniques with processor features such as...
Clustering is a common technique to overcome the wire delay problem incurred by the evolution of tec...
textIncreasing bandwidth and decreasing latency are two orthogonal techniques for improving program...
Recent evidence indicates that the exploitation of locality in dataflow programs could have a dramat...
Traditional list schedulers order instructions based on an optimistic estimate of the load latency i...
Graduation date: 2007Dynamic multithreaded processors attempt to increase the performance of a singl...
Recent studies show that very long instruction word (VLIW) architectures, which inherently have wide...
To maximize the performance of wide-issue superscalar out-of-order microprocessors, the issue stage ...
The processor speeds continue to improve at a faster rate than the memory access times. The issue o...
Instruction scheduling algorithms are used in compilers to reduce run-time delays for the compiled c...
This thesis presents a novel approach to the instruction scheduling problem for dynamic issue proces...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
CMOS technology scaling poses challenges in designing dynamically scheduled cores that can sustain b...