Applications that operate on meshes are very popular in High Performance Computing (HPC) environments. In the past, many techniques have been developed in order to optimize the memory accesses for these datasets. Different loop transformations and domain decompositions are com- monly used for structured meshes. However, unstructured grids are more challenging. The memory accesses, based on the mesh connectivity, do not map well to the usual lin- ear memory model. This work presents a method to improve the memory performance which is suitable for HPC codes that operate on meshes. We develop a method to adjust the sequence in which the data are used inside the algorithm, by means of traversing and sorting the mesh. This sorted mesh can be tra...
In this paper we present a novel algorithm for adaptive mesh refinement in computational physics mes...
This paper describes a scheme to implement a shared address space of size m on an n-node mesh, with ...
Applications that exhibit regular memory access patterns usually benefit transparently from hardware...
Mesh traversal is a common and essential geometry processing problem in computer graphics. The trave...
AbstractThis paper addresses two key parallelization challenges the unstructured mesh-based ocean mo...
Many real-life applications of processor-arrays suffer from memory bandwidth limitations. In many ca...
We present a case study to improve the cache efficiency for a simulation on a tetrahedral bisection-...
Abstract—Increasingly, the main bottleneck limiting performance on emerging multi-core and many-core...
Abstract. Irregular and sparse scientific computing programs frequently experience performance losse...
Unstructured meshes provide a distinct advantage for handing complex geometries. However, the low ca...
We present a novel method for computing cache-oblivious layouts of large meshes that improve the per...
An important class of scientific codes access memory in an irregular manner. Because irregular acce...
Unstructured-mesh based numerical algorithms such as finite volume and finite element algorithms for...
Bus traffic between the graphics subsystem and memory can become a bottleneck when rendering geometr...
Progressive mesh is a data structure that encodes a continuous spectrum of mesh approximations. Slid...
In this paper we present a novel algorithm for adaptive mesh refinement in computational physics mes...
This paper describes a scheme to implement a shared address space of size m on an n-node mesh, with ...
Applications that exhibit regular memory access patterns usually benefit transparently from hardware...
Mesh traversal is a common and essential geometry processing problem in computer graphics. The trave...
AbstractThis paper addresses two key parallelization challenges the unstructured mesh-based ocean mo...
Many real-life applications of processor-arrays suffer from memory bandwidth limitations. In many ca...
We present a case study to improve the cache efficiency for a simulation on a tetrahedral bisection-...
Abstract—Increasingly, the main bottleneck limiting performance on emerging multi-core and many-core...
Abstract. Irregular and sparse scientific computing programs frequently experience performance losse...
Unstructured meshes provide a distinct advantage for handing complex geometries. However, the low ca...
We present a novel method for computing cache-oblivious layouts of large meshes that improve the per...
An important class of scientific codes access memory in an irregular manner. Because irregular acce...
Unstructured-mesh based numerical algorithms such as finite volume and finite element algorithms for...
Bus traffic between the graphics subsystem and memory can become a bottleneck when rendering geometr...
Progressive mesh is a data structure that encodes a continuous spectrum of mesh approximations. Slid...
In this paper we present a novel algorithm for adaptive mesh refinement in computational physics mes...
This paper describes a scheme to implement a shared address space of size m on an n-node mesh, with ...
Applications that exhibit regular memory access patterns usually benefit transparently from hardware...