The Knights Landing (KNL) is the codename for the latest generation of Intel processors based on Intel Many Integrated Core (MIC) architecture. It relies on massive thread and data parallelism, and fast on-chip memory. This processor operates in standalone mode, booting an off-the-shelf Linux operating system. The KNL peak performance is very high – approximately 3 Tflops in double precision and 6 Tflops in single precision – but sustained performance depends critically on how well all parallel features of the processor are exploited by real-life applications. We assess the performance of this processor for Lattice Boltzmann codes, widely used in computational fluid-dynamics. In our OpenMP code we consider several memory data-layouts that m...
Lattice Boltzmann Methods (LBM) are an established mesoscopic approach for simulating a wide variety...
We have been developing an advanced scientific code called "ARTED" for an electron dynamics simulati...
Energy consumption of processors and memories is quickly becoming a limiting factor in the deploymen...
In this paper we report results of the analysis of computational performances and energy efficiency ...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
In this paper we report results of the analysis of computational performances and energy efficiency ...
Abstract In this paper we report on our early experience on porting, optimizing and benchmarking a...
AbstractIn this paper we report on our early experience on porting, optimizing and benchmarking a La...
A recent trend in scientific computing is the increasingly important role of co-processors, original...
This paper presents the performance analysis for both the computing performance and the energy effic...
COKA plans to study possible ways to efficiently use up-coming Intel many-core MIC architectures for...
COKA plans to study possible ways to efficiently use up-coming Intel many-core MIC architectures for...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
In this work we focus on energy performance of the Knights Landing Xeon Phi, the latest many-core ar...
Lattice Boltzmann Methods (LBM) are an established mesoscopic approach for simulating a wide variety...
We have been developing an advanced scientific code called "ARTED" for an electron dynamics simulati...
Energy consumption of processors and memories is quickly becoming a limiting factor in the deploymen...
In this paper we report results of the analysis of computational performances and energy efficiency ...
In this paper we report on our early experience on porting, optimizing and benchmarking a Lattice Bo...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
In this paper we report results of the analysis of computational performances and energy efficiency ...
Abstract In this paper we report on our early experience on porting, optimizing and benchmarking a...
AbstractIn this paper we report on our early experience on porting, optimizing and benchmarking a La...
A recent trend in scientific computing is the increasingly important role of co-processors, original...
This paper presents the performance analysis for both the computing performance and the energy effic...
COKA plans to study possible ways to efficiently use up-coming Intel many-core MIC architectures for...
COKA plans to study possible ways to efficiently use up-coming Intel many-core MIC architectures for...
In this paper we address the problem of identifying and exploiting techniques that optimize the perf...
In this work we focus on energy performance of the Knights Landing Xeon Phi, the latest many-core ar...
Lattice Boltzmann Methods (LBM) are an established mesoscopic approach for simulating a wide variety...
We have been developing an advanced scientific code called "ARTED" for an electron dynamics simulati...
Energy consumption of processors and memories is quickly becoming a limiting factor in the deploymen...