This thesis studies data-parallelism in tensor assignments. Building on an existent domain specific language for tensor calculations developed at the Chair of Compiler Construction, an extension is proposed to detect so called compatible statements, which describe when a statement is data-parallel. Using a type system, the correctness is shown and a conjecture about the optimality is proposed. By applying the extension, two optimizations for exploiting the data-parallelism are described. These optimizations reduce the memory usage for computation, therefore reducing cache misses and improving runtime. The speedup which can be gained mostly depends on the complexity of the kernel and the size of the tensors. For simple kernels like multiplic...
Improving data locality of tensor data structures is a crucial optimization for maximizing the perfo...
Improving data locality of tensor data structures is a crucial optimization for maximizing the perfo...
Improving data locality of tensor data structures is a crucial optimization for maximizing the perfo...
This thesis studies data-parallelism in tensor assignments. Building on an existent domain specific ...
This thesis studies data-parallelism in tensor assignments. Building on an existent domain specific ...
Tensors are higher-dimensional analogs of matrices, and represent a key data abstraction for many ap...
The aim of this thesis is the development of parallel algorithms for tensor arithmetic (as, e.g., do...
The aim of this thesis is the development of parallel algorithms for tensor arithmetic (as, e.g., do...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
This dissertation is concerned with the development of novel high-performance algorithms for tensor ...
This dissertation is concerned with the development of novel high-performance algorithms for tensor ...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
Improving data locality of tensor data structures is a crucial optimization for maximizing the perfo...
Improving data locality of tensor data structures is a crucial optimization for maximizing the perfo...
Improving data locality of tensor data structures is a crucial optimization for maximizing the perfo...
This thesis studies data-parallelism in tensor assignments. Building on an existent domain specific ...
This thesis studies data-parallelism in tensor assignments. Building on an existent domain specific ...
Tensors are higher-dimensional analogs of matrices, and represent a key data abstraction for many ap...
The aim of this thesis is the development of parallel algorithms for tensor arithmetic (as, e.g., do...
The aim of this thesis is the development of parallel algorithms for tensor arithmetic (as, e.g., do...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
This dissertation is concerned with the development of novel high-performance algorithms for tensor ...
This dissertation is concerned with the development of novel high-performance algorithms for tensor ...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
Improving data locality of tensor data structures is a crucial optimization for maximizing the perfo...
Improving data locality of tensor data structures is a crucial optimization for maximizing the perfo...
Improving data locality of tensor data structures is a crucial optimization for maximizing the perfo...