International audienceWith the multicore trend, the need for automatic parallelization is more pronounced, especially for legacy and proprietary code where no source code is available and/or the code is already running and restarting is not an option. In this paper, we engineer a mechanism for transforming at runtime a frequent for-loop with no data dependencies in a binary program into a parallel loop, using on-stack replacement. With our mechanism, there is no need for source code, debugging information or restarting the program. Also, the mechanism needs no static instrumentation or information. The mechanism is implemented using the Padrone binary modification system and pthreads, where the remaining iterations of the loop are executed ...
Traditional static analysis fails to auto-parallelize programs with a complex control and data flow....
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
Abstract An empirical study is presented that examines the potential to parallelize general-purpose ...
International audienceWith the multicore trend, the need for automatic parallelization is more prono...
International audienceNowadays almost every device has parallel architecture, hence parallelization ...
International audienceRuntime compilation has opportunities to parallelize code which are generally ...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
International audienceMany automatic software parallelization systems have been proposed in the past...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
AbstractSpeculative parallelization is a classic strategy for automatically parallelizing codes that...
Loops are the main source of parallelism in scientific programs. Hence, several techniques were dev...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
We present a transformational system for extracting parallelism from programs. Our transformations g...
International audienceThis paper describes a tool using one or more executions of a sequential progr...
Traditional static analysis fails to auto-parallelize programs with a complex control and data flow....
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
Abstract An empirical study is presented that examines the potential to parallelize general-purpose ...
International audienceWith the multicore trend, the need for automatic parallelization is more prono...
International audienceNowadays almost every device has parallel architecture, hence parallelization ...
International audienceRuntime compilation has opportunities to parallelize code which are generally ...
The goal of parallelizing, or restructuring, compilers is to detect and exploit parallelism in seque...
International audienceMany automatic software parallelization systems have been proposed in the past...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
AbstractSpeculative parallelization is a classic strategy for automatically parallelizing codes that...
Loops are the main source of parallelism in scientific programs. Hence, several techniques were dev...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
Current parallelizing compilers cannot identify a significant fraction of parallelizable loops becau...
We present a transformational system for extracting parallelism from programs. Our transformations g...
International audienceThis paper describes a tool using one or more executions of a sequential progr...
Traditional static analysis fails to auto-parallelize programs with a complex control and data flow....
Abstract In this paper, an approach to the problem of exploiting parallelism within nested loops is ...
Abstract An empirical study is presented that examines the potential to parallelize general-purpose ...