Upcoming Exascale target in High Performance Computing (HPC) and disruptive achievements in artificial intelligence give emergence of alternative non-conventional many-core architectures, with energy efficiency typical of embedded systems, and providing the same software ecosystem as classic HPC platforms. A key enabler of energy-efficient computing on many-core architectures is the exploitation of data locality, specifically the use of scratchpad memories in combination with DMA engines in order to overlap computation and communication. Such software paradigm raises considerable programming challenges to both the vendor and the application developer. In this thesis, we tackle the memory transfer and performance issues, as well as the progr...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
International audienceNowadays GPUs have dominated the market considering the computing/power metric...
Upcoming Exascale target in High Performance Computing (HPC) and disruptive achievements in artifici...
La prochaine cible de Exascale en calcul haute performance (High Performance Computing - HPC) et des...
Information systems and High-Performance Computing (HPC) infrastructures play an active role in the ...
In a previous PPoPP paper we showed how the FLAME method-ology, combined with the SuperMatrix runtim...
Parallelism in today's computer architectures is ubiquitous whether it be in supercomputers, worksta...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
Abstract. Traditional parallel programming methodologies for improv-ing performance assume cache-bas...
Lattice Boltzmann method (LBM) is an important computational fluid dynamics (CFD) approach to solvin...
The objective of high performance computing (HPC) is to ensure that the computational power of hardw...
Along with the traditional CPU cores, processing units of different architectures have been employed...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
The high performance computing (HPC) community is obsessed over the general matrix-matrix multiply (...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
International audienceNowadays GPUs have dominated the market considering the computing/power metric...
Upcoming Exascale target in High Performance Computing (HPC) and disruptive achievements in artifici...
La prochaine cible de Exascale en calcul haute performance (High Performance Computing - HPC) et des...
Information systems and High-Performance Computing (HPC) infrastructures play an active role in the ...
In a previous PPoPP paper we showed how the FLAME method-ology, combined with the SuperMatrix runtim...
Parallelism in today's computer architectures is ubiquitous whether it be in supercomputers, worksta...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
Abstract. Traditional parallel programming methodologies for improv-ing performance assume cache-bas...
Lattice Boltzmann method (LBM) is an important computational fluid dynamics (CFD) approach to solvin...
The objective of high performance computing (HPC) is to ensure that the computational power of hardw...
Along with the traditional CPU cores, processing units of different architectures have been employed...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
The high performance computing (HPC) community is obsessed over the general matrix-matrix multiply (...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
We present an auto-tuning approach to optimize application performance on emerging multicore archite...
International audienceNowadays GPUs have dominated the market considering the computing/power metric...