Fine-grain task-parallel algorithms for matrix factorizations and inversion on many-threaded CPUs

Catalán Pallarés, Sandra
Herrero Zaragoza, José Ramón
Igual Peña, Francisco D.
Quintana Ortí, Enrique Salvador
Rodríguez Sánchez, Rafael

Open PDF

Open link

Publication date

December 2023

DOI

10.1002/cpe.6999

Publisher

Wiley (John Wiley & Sons)

Journal

Concurrency and Computation Practice and Experience

Language

English

Abstract

We extend a two-level task partitioning previously applied to the inversion of dense matrices via Gauss–Jordan elimination to the more challenging QR factorization as well as the initial orthogonal reduction to band form found in the singular value decomposition. Our new task-parallel algorithms leverage the tasking mechanism currently available in OpenMP to exploit “nested” task parallelism, with a first outer level that operates on matrix panels and a second inner level that processes the matrix either by µ -panels or by tiles, in order to expose a large number of independent tasks. We present a detailed performance analysis, including execution traces, which shows that the two-level refinement into fine grain tasks allows for an improved...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Fine-grain task-parallel algorithms for matrix factorizations and inversion on many-threaded CPUs

Abstract

Extracted data

Fine-grain task-parallel algorithms for matrix factorizations and inversion on many-threaded CPUs

Abstract

Extracted data

Related items

Related items