Abstract—Load elimination is a classical compiler transfor-mation that is increasing in importance for multi-core and many-core architectures. The effect of the transformation is to replace a memory access, such as a read of an object field or an array element, by a read of a compiler-generated temporary that can be allocated in faster and more energy-efficient storage structures such as registers and local memories (scratchpads). Unfortunately, current just-in-time and dynamic compilers perform load elimination only in limited situations. In particular, they usually make worst-case assumptions about potential side effects arising from parallel constructs and method calls. These two constraints interact with each other since parallel constr...
Distributed memory parallel architectures support a memory model where some memory accesses are loca...
This paper presents compiler algorithms to optimize out-of-core programs. These algorithms consider ...
Most current compiler analysis techniques are unable to cope with the semantics introduced by explic...
The power, frequency, and memory wall problems have caused a major shift in mainstream computing by ...
Abstract. Partial redundancy elimination can reduce the number of loads corresponding to field and a...
Program redundancy analysis and optimization have been an important component in optimizing compiler...
Partial Redundancy Elimination (PRE) is a general scheme for suppressing partial redundancies which ...
There has been a proliferation of task-parallel programming sys-tems to address the requirements of ...
This paper describes transformation techniques for out-of-core pro-grams (i.e., those that deal with...
Achieving high performance in task-parallel runtime systems, especially with high degrees of paralle...
Achieving high performance in task-parallel runtime systems, especially with high degrees of paralle...
Abstract. Dynamic compilation is becoming a dominant compilation technique. Runtime compilation has ...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
It is possible to reduce the computation time of data parallel programs by dividing the computation ...
This paper describes optimization techniques for translating out-of-core programs written in a data ...
Distributed memory parallel architectures support a memory model where some memory accesses are loca...
This paper presents compiler algorithms to optimize out-of-core programs. These algorithms consider ...
Most current compiler analysis techniques are unable to cope with the semantics introduced by explic...
The power, frequency, and memory wall problems have caused a major shift in mainstream computing by ...
Abstract. Partial redundancy elimination can reduce the number of loads corresponding to field and a...
Program redundancy analysis and optimization have been an important component in optimizing compiler...
Partial Redundancy Elimination (PRE) is a general scheme for suppressing partial redundancies which ...
There has been a proliferation of task-parallel programming sys-tems to address the requirements of ...
This paper describes transformation techniques for out-of-core pro-grams (i.e., those that deal with...
Achieving high performance in task-parallel runtime systems, especially with high degrees of paralle...
Achieving high performance in task-parallel runtime systems, especially with high degrees of paralle...
Abstract. Dynamic compilation is becoming a dominant compilation technique. Runtime compilation has ...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
It is possible to reduce the computation time of data parallel programs by dividing the computation ...
This paper describes optimization techniques for translating out-of-core programs written in a data ...
Distributed memory parallel architectures support a memory model where some memory accesses are loca...
This paper presents compiler algorithms to optimize out-of-core programs. These algorithms consider ...
Most current compiler analysis techniques are unable to cope with the semantics introduced by explic...