Contractions of nonsymmetric tensors are reducible to matrix multiplication, however, ‘fully symmetric contractions’ in which the tensors are symmetric and the result is symmetrized can be done with fewer operations. The ‘direct evaluation algorithm’ for fully symmetric contractions exploits equivalence between terms in the contraction equation to obtain a lower computation cost than the cost associated with nonsymmetric contractions. The ‘symmetry preserving algorithm’ lowers the cost even further via an algebraic reorganization of the contraction equation. We derive vertical (between memory and cache) and horizontal (interprocessor) communication lower bounds for both of these algorithms. We demonstrate that any load balanced parallel sch...
Coupled-cluster methods provide highly accurate models of molecular structure through explicit numer...
AbstractThe optimization of tensor expressions with hundreds of terms is required for the developmen...
Global communication optimization for tensor contraction expressions under memory constraint
Contractions of nonsymmetric tensors are reducible to matrix mul-tiplication, however, ‘fully symmet...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
Abstract. Symmetric tensor operations arise in a wide variety of computations. However, the benefits...
Abstract. Complex tensor contraction expressions arise in accurate electronic structure models in qu...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceIn this paper, we focus on the parallel communication cost of multiplying a ma...
This dissertation is concerned with the development of novel high-performance algorithms for tensor ...
This dissertation is concerned with the development of novel high-performance algorithms for tensor ...
Coupled-cluster methods provide highly accurate models of molecular structure through explicit numer...
AbstractThe optimization of tensor expressions with hundreds of terms is required for the developmen...
Global communication optimization for tensor contraction expressions under memory constraint
Contractions of nonsymmetric tensors are reducible to matrix mul-tiplication, however, ‘fully symmet...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
Abstract. Symmetric tensor operations arise in a wide variety of computations. However, the benefits...
Abstract. Complex tensor contraction expressions arise in accurate electronic structure models in qu...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceIn this paper, we focus on the parallel communication cost of multiplying a ma...
This dissertation is concerned with the development of novel high-performance algorithms for tensor ...
This dissertation is concerned with the development of novel high-performance algorithms for tensor ...
Coupled-cluster methods provide highly accurate models of molecular structure through explicit numer...
AbstractThe optimization of tensor expressions with hundreds of terms is required for the developmen...
Global communication optimization for tensor contraction expressions under memory constraint