Thesis (Master's)--University of Washington, 2019Previous work has developed a tool, the Tensor Template Library (TTL), which uses variadic expression template metaprogramming to capture tensor behaviors clearly and in a manner resembling the mathematical abstraction engineers are familiar with while concealing the cumbersome looping structures, in an optimized manner. This has utility in simulating physical systems in material sci-ence via finite element modelling, but with applications in systems with large numbers of small, dense tensors. The initial work of this author was to update the TTL to operate within a graphics processing unit (GPU), build a test suite to verify those updates compiled and generated correct output in a GPU enviro...
The graphics processing unit (GPU) was initially designed for raster-based graphics com- putations, ...
There has been a surge in the demand for a Domain Specific Architecture due to wide ranging deep lea...
We present Template Numerical Library (TNL, www.tnl-project.org) with native support of modern paral...
Computational intensive applications such as pattern recognition, and natural language processing, a...
In the CFD solver Nek5000, the computation is dominated by the evaluation of small tensor operations...
AbstractWe present a computational framework for high-performance tensor contractions on GPUs. High-...
Virtual prototyping, the iterative process of using computer-aided (CAx) modeling, simulation, and v...
We present a C++ library, TLoops, which uses a hierarchy of expression templates to represent operat...
Tensor Cores have been an important unit to accelerate Fused Matrix Multiplication Accumulation (MMA...
Tensor Cores (TCUs) are specialized units first introduced by NVIDIA in the Volta microarchitecture ...
Template methods have opened up a new way of building C++ libraries. These methods allow the librari...
Tensors are higher-dimensional analogs of matrices, and represent a key data abstraction for many ap...
Tensor algorithms are a rapidly growing field of research with applications in many scientific domai...
As the adoption of parallel and heterogeneous systems increases, programming such systems also becom...
Complex tensor contraction expressions arise in accurate electronic structure models in quantum chem...
The graphics processing unit (GPU) was initially designed for raster-based graphics com- putations, ...
There has been a surge in the demand for a Domain Specific Architecture due to wide ranging deep lea...
We present Template Numerical Library (TNL, www.tnl-project.org) with native support of modern paral...
Computational intensive applications such as pattern recognition, and natural language processing, a...
In the CFD solver Nek5000, the computation is dominated by the evaluation of small tensor operations...
AbstractWe present a computational framework for high-performance tensor contractions on GPUs. High-...
Virtual prototyping, the iterative process of using computer-aided (CAx) modeling, simulation, and v...
We present a C++ library, TLoops, which uses a hierarchy of expression templates to represent operat...
Tensor Cores have been an important unit to accelerate Fused Matrix Multiplication Accumulation (MMA...
Tensor Cores (TCUs) are specialized units first introduced by NVIDIA in the Volta microarchitecture ...
Template methods have opened up a new way of building C++ libraries. These methods allow the librari...
Tensors are higher-dimensional analogs of matrices, and represent a key data abstraction for many ap...
Tensor algorithms are a rapidly growing field of research with applications in many scientific domai...
As the adoption of parallel and heterogeneous systems increases, programming such systems also becom...
Complex tensor contraction expressions arise in accurate electronic structure models in quantum chem...
The graphics processing unit (GPU) was initially designed for raster-based graphics com- putations, ...
There has been a surge in the demand for a Domain Specific Architecture due to wide ranging deep lea...
We present Template Numerical Library (TNL, www.tnl-project.org) with native support of modern paral...