The multidimensional positive definite advection transport algorithm (MPDATA) belongs to the group of nonoscillatory forward-in-time algorithms and performs a sequence of stencil computations. MPDATA is one of the major parts of the dynamic core of the EULAG geophysical model. In this work, we outline an approach to adaptation of the 3D MPDATA algorithm to the Intel MIC architecture. In order to utilize available computing resources, we propose the (3 + 1)D decomposition of MPDATA heterogeneous stencil computations. This approach is based on combination of the loop tiling and fusion techniques. It allows us to ease memory/communication bounds and better exploit the theoretical floating point efficiency of target computing platforms. An impo...
This article provides a comprehensive study of the impact of performance optimizations on the energy...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
This paper describes an approach for acceleration of the Hybrid Total FETI (HTFETI) domain decomposi...
Partial Differential Equations (PDEs) are widely used to simulate many scenarios in science and engi...
International audienceDuring recent years computer processors have become increasingly complex (mult...
| openaire: EC/H2020/818665/EU//UniSDyn Funding Information: This work was supported by the Academy ...
This thesis is dedicated to the implementation of high performance algorithms on the Intel Xeon Phi ...
\u3cp\u3eReal-world weather forecasting applications consist of compound stencil kernels that do not...
Abstract. This paper presents the design and implementation of several funda-mental dense linear alg...
ISBN : 9783662480953International audienceThis work presents a hybrid MPI/OpenMP parallelization str...
Deep and shallow convection calculations occupy significant times in atmosphere models. These calcul...
We propose an approach to estimate the power consumption of algorithms, as a function of the frequen...
Real-world weather forecasting applications consist of compound stencil kernels that do not perform ...
APOLLO3® and TRIPOLI-4® are registered trademark of CEAInternational audienceIn this paper we analyz...
The Intel R Xeon PhiTM is the first processor based on Intel’s MIC (Many Integrated Cores) architect...
This article provides a comprehensive study of the impact of performance optimizations on the energy...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
This paper describes an approach for acceleration of the Hybrid Total FETI (HTFETI) domain decomposi...
Partial Differential Equations (PDEs) are widely used to simulate many scenarios in science and engi...
International audienceDuring recent years computer processors have become increasingly complex (mult...
| openaire: EC/H2020/818665/EU//UniSDyn Funding Information: This work was supported by the Academy ...
This thesis is dedicated to the implementation of high performance algorithms on the Intel Xeon Phi ...
\u3cp\u3eReal-world weather forecasting applications consist of compound stencil kernels that do not...
Abstract. This paper presents the design and implementation of several funda-mental dense linear alg...
ISBN : 9783662480953International audienceThis work presents a hybrid MPI/OpenMP parallelization str...
Deep and shallow convection calculations occupy significant times in atmosphere models. These calcul...
We propose an approach to estimate the power consumption of algorithms, as a function of the frequen...
Real-world weather forecasting applications consist of compound stencil kernels that do not perform ...
APOLLO3® and TRIPOLI-4® are registered trademark of CEAInternational audienceIn this paper we analyz...
The Intel R Xeon PhiTM is the first processor based on Intel’s MIC (Many Integrated Cores) architect...
This article provides a comprehensive study of the impact of performance optimizations on the energy...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
This paper describes an approach for acceleration of the Hybrid Total FETI (HTFETI) domain decomposi...