In the paper we study the performance of the regularized boundary element quadrature routines implemented in the BEM4I library developed by the authors. Apart from the results obtained on the classical multi-core architecture represented by the Intel Xeon processors we concentrate on the portability of the code to the many-core family Intel Xeon Phi. Contrary to the GP-GPU programming accelerating many scientific codes, the standard x86 architecture of the Xeon Phi processors allows to reuse the already existing multi-core implementation. Although in many cases a simple recompilation would lead to an inefficient utilization of the Xeon Phi, the effort invested in the optimization usually leads to a better performance on the multi-core Xeon ...
AbstractModern multicore and manycore processors exhibit multiple levels of parallelism through a wi...
The objective of this paper is to extend, in the context of multicore architectures, the concepts of...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...
The Intel R Xeon PhiTM is the first processor based on Intel’s MIC (Many Integrated Cores) architect...
Although parallelization of computationally intensive algorithms has become a standard with the scie...
The article is devoted to the vectorization of calculations for Intel Xeon Phi Knights Landing (KNL)...
Partial Differential Equations (PDEs) are widely used to simulate many scenarios in science and engi...
In this paper, we describe an efficient analytic evaluation of boundary integral operators. Firstly,...
We investigate a domain decomposition method (DDM) of finite element method (FEM) using Intel's...
International audienceOn the road to exascale, coprocessors are increasingly becoming key building b...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
This thesis describes novel techniques and test implementations for optimizing numerically intensive...
Modern multicore and manycore processors exhibit multiple levels of parallelism through a wide range...
The finite element method (FEM) is one of the most commonly used techniques for the solution of part...
AbstractThe boundary element method has its origins in the boundary integral equation method [1] and...
AbstractModern multicore and manycore processors exhibit multiple levels of parallelism through a wi...
The objective of this paper is to extend, in the context of multicore architectures, the concepts of...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...
The Intel R Xeon PhiTM is the first processor based on Intel’s MIC (Many Integrated Cores) architect...
Although parallelization of computationally intensive algorithms has become a standard with the scie...
The article is devoted to the vectorization of calculations for Intel Xeon Phi Knights Landing (KNL)...
Partial Differential Equations (PDEs) are widely used to simulate many scenarios in science and engi...
In this paper, we describe an efficient analytic evaluation of boundary integral operators. Firstly,...
We investigate a domain decomposition method (DDM) of finite element method (FEM) using Intel's...
International audienceOn the road to exascale, coprocessors are increasingly becoming key building b...
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) ...
This thesis describes novel techniques and test implementations for optimizing numerically intensive...
Modern multicore and manycore processors exhibit multiple levels of parallelism through a wide range...
The finite element method (FEM) is one of the most commonly used techniques for the solution of part...
AbstractThe boundary element method has its origins in the boundary integral equation method [1] and...
AbstractModern multicore and manycore processors exhibit multiple levels of parallelism through a wi...
The objective of this paper is to extend, in the context of multicore architectures, the concepts of...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...