Microprocessor designs are now changing to reflect the ending of Dennard Scaling. This leads to a reconsideration of design tradeoffs for designing discretization methods for PDEs based on simplified performance models like Roofline.In this work we carry out an end-to-end analysis and implementation study on a Cray XC40 with Intel⃝R XeonTM E5-2698 v3 processors for the Method of Local Corrections (MLC). MLC is a non-iterative method for solving Poisson’s Equation on locally rectangular meshes. The Roofline model predicts that MLC should have faster time to solution than traditional iterative methods such as Geometric Multigrid. We find that Roofline is a useful guide for performance engineering and obtain perfor- mance within a factor of 3 ...
New adaptive local refinement (ALR) strategies are developed, the goal of which is to reach a given ...
The life-cycle of a partial differential equation (PDE) solver is often characterized by three devel...
Preconditioned iterative solver is one of the most powerful choice such as IC (Incomplete Cholesky) ...
Microprocessor designs are now changing to reflect the ending of Dennard Scaling. This leads to a re...
We present a second-order accurate algorithm for solving the free-space Poisson’s equation on a loca...
We present a second-order accurate algorithm for solving the free-space Poisson's equation on a loc...
We present a second-order accurate algorithm for solving thefree-space Poisson's equation on a local...
We study the local defect correction (LDC) method, introduced in [7]. We focus on the behavior of LD...
Two acceleration techniques, based on additive corrections are evaluated with a multithreaded 2D Poi...
AbstractTwo acceleration techniques, based on additive corrections are evaluated with a multithreade...
n. Itroduction The architectural differences between a serial and a parallel machine raise a number ...
AbstractMultiprocessor systems offer large gains in performance if algorithms for real problems can ...
Two block cyclic reduction linear system solvers are considered and implemented using the OpenCL fr...
In this paper, we discuss some of the issues in obtaining high performance for block-structured adap...
The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built fr...
New adaptive local refinement (ALR) strategies are developed, the goal of which is to reach a given ...
The life-cycle of a partial differential equation (PDE) solver is often characterized by three devel...
Preconditioned iterative solver is one of the most powerful choice such as IC (Incomplete Cholesky) ...
Microprocessor designs are now changing to reflect the ending of Dennard Scaling. This leads to a re...
We present a second-order accurate algorithm for solving the free-space Poisson’s equation on a loca...
We present a second-order accurate algorithm for solving the free-space Poisson's equation on a loc...
We present a second-order accurate algorithm for solving thefree-space Poisson's equation on a local...
We study the local defect correction (LDC) method, introduced in [7]. We focus on the behavior of LD...
Two acceleration techniques, based on additive corrections are evaluated with a multithreaded 2D Poi...
AbstractTwo acceleration techniques, based on additive corrections are evaluated with a multithreade...
n. Itroduction The architectural differences between a serial and a parallel machine raise a number ...
AbstractMultiprocessor systems offer large gains in performance if algorithms for real problems can ...
Two block cyclic reduction linear system solvers are considered and implemented using the OpenCL fr...
In this paper, we discuss some of the issues in obtaining high performance for block-structured adap...
The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built fr...
New adaptive local refinement (ALR) strategies are developed, the goal of which is to reach a given ...
The life-cycle of a partial differential equation (PDE) solver is often characterized by three devel...
Preconditioned iterative solver is one of the most powerful choice such as IC (Incomplete Cholesky) ...