While the chip multiprocessor (CMP) has quickly become the predominant processor architecture, its continuing success largely depends on the parallelizability of complex programs. We present a framework that is able to extract coarse-grain function-level parallelism that can exploit the parallel resources of the CMP. The framework uses a profile-driven control and data dependence analysis between large code regions. We target coarse-grain parallelism by finding do-across parallelism in the outer-loops of a program. This parallelism can be exploited in a pipelined fashion. The identification of parallelism reduces the overall loop structure to a preset template by merging inter-dependent code regions. The actual parallelization is guided by ...
The end of Dennard scaling also brought an end to frequency scaling as a means to improve performanc...
Parallel computers can provide impressive speedups, but unfortunately such speedups are difficult to...
As multicore processors are deployed in mainstream computing, the need for software tools to help pa...
With the rise of chip-multiprocessors, the problem of parallelizing general-purpose programs has onc...
While the chip multiprocessor (CMP) has quickly become the predominant processor architecture, its c...
While the chip multiprocessor (CMP) has quickly become the predominant processor architecture, its c...
With the rise of Chip multiprocessors (CMPs), the amount of parallel computing power will increase s...
Traditional static analysis fails to auto-parallelize programs with a complex control and data flow....
With the rise of Chip multiprocessors (CMPs), the amount of parallel computing power will increase s...
International audienceThis paper describes a tool using one or more executions of a sequential progr...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2012.Speculative parallelizatio...
Traditional parallelism detection in compilers is performed by means of static analysis and more spe...
While the chip multiprocessor (CMP) has quickly become the predominant processor architecture, its c...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
The end of Dennard scaling also brought an end to frequency scaling as a means to improve performanc...
Parallel computers can provide impressive speedups, but unfortunately such speedups are difficult to...
As multicore processors are deployed in mainstream computing, the need for software tools to help pa...
With the rise of chip-multiprocessors, the problem of parallelizing general-purpose programs has onc...
While the chip multiprocessor (CMP) has quickly become the predominant processor architecture, its c...
While the chip multiprocessor (CMP) has quickly become the predominant processor architecture, its c...
With the rise of Chip multiprocessors (CMPs), the amount of parallel computing power will increase s...
Traditional static analysis fails to auto-parallelize programs with a complex control and data flow....
With the rise of Chip multiprocessors (CMPs), the amount of parallel computing power will increase s...
International audienceThis paper describes a tool using one or more executions of a sequential progr...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2012.Speculative parallelizatio...
Traditional parallelism detection in compilers is performed by means of static analysis and more spe...
While the chip multiprocessor (CMP) has quickly become the predominant processor architecture, its c...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
The end of Dennard scaling also brought an end to frequency scaling as a means to improve performanc...
Parallel computers can provide impressive speedups, but unfortunately such speedups are difficult to...
As multicore processors are deployed in mainstream computing, the need for software tools to help pa...