International audienceTechnology evolution gives an easy access to high performance dedicated computing machines using, for example, GPUs or FPGAS. When designing algorithms dealing with highly structured multidimensional data, the real bottleneck is often linked to memory access. The strategies implemented in standard CPU cache architectures are no longer efficient due to the parallelism level and the inherent structure of data. This article presents the so-called "n-Dimensional Adaptive and Predictive Cache" (nD-AP Cache) architecture aiming at efficient data access for grid traversal. A theoretical model of the 3D version of the cache was setup in order to predict the cache efficiency for given statistical characteristics of the access s...
The central data structures for many applications in scientific computing are large multidimensional...
Abstract—On-chip caches are commonly used in computer systems to hide long off-chip memory access la...
Present and future distributed applications need to deal with very large PetaBytes (PB) datasets and...
International audienceSystem On Chip designs commonly use high performance data processing engines a...
International audiencePre-fetching in a memory hierarchy is known to alleviate the “memory wall” par...
International audienceWith the emergence of manycore architectures, the need of on-chip memories suc...
The parallelization of processors has led to a increased need of external memory bandwidth. As the n...
A major challenge when designing general purpose graphics hardware is to allow efficient access to t...
A well-designed cache system has positive impacts on the 3D real-time rendering engine. As the amoun...
Abstract. Many current computer designs employ caches and a hierarchical memory architec-ture. The s...
Figure 1: Three example scenes rendered using sample caching. Using caching during walkthrough anima...
. Many current computer designs employ caches and a hierarchical memory architecture. The speed of a...
AbstractIt is crucial to optimize stencil computations since they are the core (and most computation...
One important bottleneck when visualizing large data sets is the data transfer between processor and...
International audienceCaching is an important element to tackle performance issues in largely distri...
The central data structures for many applications in scientific computing are large multidimensional...
Abstract—On-chip caches are commonly used in computer systems to hide long off-chip memory access la...
Present and future distributed applications need to deal with very large PetaBytes (PB) datasets and...
International audienceSystem On Chip designs commonly use high performance data processing engines a...
International audiencePre-fetching in a memory hierarchy is known to alleviate the “memory wall” par...
International audienceWith the emergence of manycore architectures, the need of on-chip memories suc...
The parallelization of processors has led to a increased need of external memory bandwidth. As the n...
A major challenge when designing general purpose graphics hardware is to allow efficient access to t...
A well-designed cache system has positive impacts on the 3D real-time rendering engine. As the amoun...
Abstract. Many current computer designs employ caches and a hierarchical memory architec-ture. The s...
Figure 1: Three example scenes rendered using sample caching. Using caching during walkthrough anima...
. Many current computer designs employ caches and a hierarchical memory architecture. The speed of a...
AbstractIt is crucial to optimize stencil computations since they are the core (and most computation...
One important bottleneck when visualizing large data sets is the data transfer between processor and...
International audienceCaching is an important element to tackle performance issues in largely distri...
The central data structures for many applications in scientific computing are large multidimensional...
Abstract—On-chip caches are commonly used in computer systems to hide long off-chip memory access la...
Present and future distributed applications need to deal with very large PetaBytes (PB) datasets and...