Abstract. We present a new technique to automatically optimize parallel soft-ware for multi-core processors. We have implemented the technique for Bamboo, a task-based extension to Java. Optimizing applications for multi-core processors requires balancing the competing concerns of parallelism and communication costs. Bamboo uses high-level simulation to explore how to best trade off these competing concerns for an application. The compiler begins by generating several initial candidate implementations. The compiler then uses high-level simulation with profile statistics to evaluate these candidate implementations. It uses an as-built critical path analysis to automatically identify opportunities to improve the candidate implementation and t...
Multiple multi-threaded tasks constitute a modern many-core application. An accompanying generic Dir...
The recent transformation from an environment where gains in computational performance came from inc...
In the area of parallel processing, performance has been the primary goal, and parallel software wri...
146 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2008.My work discusses various str...
This paper explains the programming aspects of a promising Java-based programming and execution fram...
The performance of a computer system is important. One way of improving performance is to use multip...
Multi-core processors are everywhere now, researchers all over the world are finding ways to benchm...
With the modern chip design facing the so called frequency, power and other walls, multi-core system...
In order to speed up the execution of Java applications, JIT com-pilers compile method bytecodes int...
Future integrated systems will contain billions of transistors, composing tens to hundreds of IP cor...
Key Results: A simulation platform for microprocessor architectures written in Java. The simulation...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
The goal of this dissertation is to give programmers the ability to achieve high performance by focu...
The performance of parallel code significantly depends on the parallel task granularity (PTG). If th...
[[abstract]]We are interested in the issues on the bytecode transformation for performance improveme...
Multiple multi-threaded tasks constitute a modern many-core application. An accompanying generic Dir...
The recent transformation from an environment where gains in computational performance came from inc...
In the area of parallel processing, performance has been the primary goal, and parallel software wri...
146 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2008.My work discusses various str...
This paper explains the programming aspects of a promising Java-based programming and execution fram...
The performance of a computer system is important. One way of improving performance is to use multip...
Multi-core processors are everywhere now, researchers all over the world are finding ways to benchm...
With the modern chip design facing the so called frequency, power and other walls, multi-core system...
In order to speed up the execution of Java applications, JIT com-pilers compile method bytecodes int...
Future integrated systems will contain billions of transistors, composing tens to hundreds of IP cor...
Key Results: A simulation platform for microprocessor architectures written in Java. The simulation...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
The goal of this dissertation is to give programmers the ability to achieve high performance by focu...
The performance of parallel code significantly depends on the parallel task granularity (PTG). If th...
[[abstract]]We are interested in the issues on the bytecode transformation for performance improveme...
Multiple multi-threaded tasks constitute a modern many-core application. An accompanying generic Dir...
The recent transformation from an environment where gains in computational performance came from inc...
In the area of parallel processing, performance has been the primary goal, and parallel software wri...