The current trend in parallel computers is that systems with a large shared memory are becoming more and more popular. A shared memory system can be either a uniform memory architecture (UMA) or a cache coherent non-uniform memory architecture (cc-NUMA). In the present thesis, the performance of parallel PDE solvers on cc-NUMA computers is studied. In particular, we consider the shared namespace programming model, represented by OpenMP. Since the main memory is physically, or geographically distributed over several multi-processor nodes, the latency for local memory accesses is smaller than for remote accesses. Therefore, the geographical locality of the data becomes important. The questions posed in this thesis are: (1) How large is the in...
The OpenMP programming model is based upon the assumption of uniform memory access. Virtually all cu...
Cache Coherent Non-Uniform Memory Access (CC-NUMA) architectures have received strong interests from...
OpenMP has established itself as the de facto standard for parallel programming on shared-memory pla...
A trend in parallel computer architecture is that systems with a large shared memory are becoming mo...
International audienceWe introduce shared-memory parallelism in a parallel distributed-memory solver...
Scientific computing is used frequently in an increasing number of disciplines to accelerate scienti...
Computer simulations that solve partial differential equations (PDEs) are common in many fields of s...
Solving Partial Differential Equations (PDEs) is an important problem in many fields of science and ...
Abstract. OpenMP has become the dominant standard for shared memory pro-gramming. It is traditionall...
We consider computations associated with data parallel iterative solvers used for the numerical solu...
The majority of current HPC applications are composed of complex and irregular data structures that ...
Non-uniform memory access (NUMA) architectures are modern shared-memory, multi-core machines offerin...
Although there exist several approaches to rapidly solving the N-body problem, and a diversity of im...
Today's supercomputers often consists of clusters of SMP nodes. Both OpenMP and MPI are programming ...
We consider computations associated with data parallel iterative solvers used for the numerical solu...
The OpenMP programming model is based upon the assumption of uniform memory access. Virtually all cu...
Cache Coherent Non-Uniform Memory Access (CC-NUMA) architectures have received strong interests from...
OpenMP has established itself as the de facto standard for parallel programming on shared-memory pla...
A trend in parallel computer architecture is that systems with a large shared memory are becoming mo...
International audienceWe introduce shared-memory parallelism in a parallel distributed-memory solver...
Scientific computing is used frequently in an increasing number of disciplines to accelerate scienti...
Computer simulations that solve partial differential equations (PDEs) are common in many fields of s...
Solving Partial Differential Equations (PDEs) is an important problem in many fields of science and ...
Abstract. OpenMP has become the dominant standard for shared memory pro-gramming. It is traditionall...
We consider computations associated with data parallel iterative solvers used for the numerical solu...
The majority of current HPC applications are composed of complex and irregular data structures that ...
Non-uniform memory access (NUMA) architectures are modern shared-memory, multi-core machines offerin...
Although there exist several approaches to rapidly solving the N-body problem, and a diversity of im...
Today's supercomputers often consists of clusters of SMP nodes. Both OpenMP and MPI are programming ...
We consider computations associated with data parallel iterative solvers used for the numerical solu...
The OpenMP programming model is based upon the assumption of uniform memory access. Virtually all cu...
Cache Coherent Non-Uniform Memory Access (CC-NUMA) architectures have received strong interests from...
OpenMP has established itself as the de facto standard for parallel programming on shared-memory pla...