The increasing complexity of modern hardware requires sophisticated programming techniques for programs to run efficiently. At the same time, increased power of modern hardware enables more advanced analyses to be included in compilers. This thesis focuses on one particular optimisation technique that improves utilisation of vector units. The foundation of this technique is the ability to chose memory mappings for data structures of a given program. Usually programming languages use a fixed layout for logical data structures in physical memory. Such a static mapping often has a negative effect on usability of vector units. In this thesis we consider a compiler for a programming language that allows every data structure in a program...
Abstract. Heterogeneity is a confirmed trend of computing systems. Byte-code formats and just-in-tim...
In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...
This paper describes methods to adapt existing optimizing compilers for sequential languages to prod...
SIMD (Single Instruction, Multiple Data) instruction sets are ubiquitous on modern hardware, but rar...
As an effective way of utilizing data parallelism in applications, SIMD architecture has been adopte...
International audienceFrom a high level point of view, developers define objects they manipulate in ...
Data-parallel programming is more important than ever since serial performance is stagnating. All ma...
International audienceSIMD processor units have become ubiquitous. Using SIMD instructions is the ke...
This work establishes a scalable, easy to use and efficient approach for exploiting SIMD capabilitie...
A long-standing challenge in High-Performance Computing (HPC) is the simultaneous achievement of ...
As an effective way of utilizing data parallelism in applications, SIMD architecture has been adopte...
SIMD extensions were added to microprocessors in the mid '90s to speed-up data-parallel code by vect...
Expressing algorithms using immutable arrays greatly simplifies the challenges of automatic SIMD vec...
Modern CPUs are equipped with Single Instruction Multiple Data (SIMD) engines operating on short vec...
Data locality and parallelism are critical optimization objectives for performance on modern multi-c...
Abstract. Heterogeneity is a confirmed trend of computing systems. Byte-code formats and just-in-tim...
In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...
This paper describes methods to adapt existing optimizing compilers for sequential languages to prod...
SIMD (Single Instruction, Multiple Data) instruction sets are ubiquitous on modern hardware, but rar...
As an effective way of utilizing data parallelism in applications, SIMD architecture has been adopte...
International audienceFrom a high level point of view, developers define objects they manipulate in ...
Data-parallel programming is more important than ever since serial performance is stagnating. All ma...
International audienceSIMD processor units have become ubiquitous. Using SIMD instructions is the ke...
This work establishes a scalable, easy to use and efficient approach for exploiting SIMD capabilitie...
A long-standing challenge in High-Performance Computing (HPC) is the simultaneous achievement of ...
As an effective way of utilizing data parallelism in applications, SIMD architecture has been adopte...
SIMD extensions were added to microprocessors in the mid '90s to speed-up data-parallel code by vect...
Expressing algorithms using immutable arrays greatly simplifies the challenges of automatic SIMD vec...
Modern CPUs are equipped with Single Instruction Multiple Data (SIMD) engines operating on short vec...
Data locality and parallelism are critical optimization objectives for performance on modern multi-c...
Abstract. Heterogeneity is a confirmed trend of computing systems. Byte-code formats and just-in-tim...
In the last years, there has been much effort in commercial compilers (icc, gcc) to exploit efficien...
This paper describes methods to adapt existing optimizing compilers for sequential languages to prod...