This thesis targets the design of parallelizable algorithms and communication-efficient parallel schedules for numerical linear algebra as well as computations with higher-order tensors. Communication is a growing bottleneck in the execution of most algorithms on parallel computers, which manifests itself as data movement both through the network connecting different processors and through the memory hierarchy of each processor as well as synchronization between processors. We provide a rigorous theoretical model of communication and derive lower bounds as well as algorithms in this model. Our analysis concerns two broad areas of linear algebra and of tensor contractions. We demonstrate the practical quality of the new theoretically-improve...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
Dense linear algebra computations are essential to nearly every problem in scientific computing and ...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
Contractions of nonsymmetric tensors are reducible to matrix multiplication, however, ‘fully symmetr...
This electronic version was submitted by the student author. The certified thesis is available in th...
Contractions of nonsymmetric tensors are reducible to matrix mul-tiplication, however, ‘fully symmet...
Coupled-cluster methods provide highly accurate models of molecular structure through explicit numer...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
Parallel matrix multiplication is one of the most studied fun-damental problems in distributed and h...
Multilinear algebra kernel performance on modern massively-parallel systems is determined mainly by ...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
Dense linear algebra computations are essential to nearly every problem in scientific computing and ...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
Contractions of nonsymmetric tensors are reducible to matrix multiplication, however, ‘fully symmetr...
This electronic version was submitted by the student author. The certified thesis is available in th...
Contractions of nonsymmetric tensors are reducible to matrix mul-tiplication, however, ‘fully symmet...
Coupled-cluster methods provide highly accurate models of molecular structure through explicit numer...
140 pagesTensor algebra lives at the heart of big data applications. Where classical machine learnin...
Parallel matrix multiplication is one of the most studied fun-damental problems in distributed and h...
Multilinear algebra kernel performance on modern massively-parallel systems is determined mainly by ...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...
International audienceMultiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms fo...