This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both explicit and im-plicit time-marching methods are considered, with the lat-ter requiring the solution of multiple tridiagonal systems of equations. Because of the small amount of data involved, one-factor models are primarily compute-limited, with a very good fraction of the peak compute capability being achieved. The key to the performance lies in the heavy use of registers and shuffle instructions for the explicit method, and a non-standard hybrid Thomas/PCR algorithm for solving the tridi-agonal systems for the implicit solver. The three-factor problems involve much more data, and hence their execution is more evenly balanced between com-putati...
This paper presents the use of three-dimensional indexing available in graphic processing units (GPU...
This paper discusses the main performance barriers for solving a large number of independent ordinar...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
The solution of tridiagonal system of equations using graphic processing units (GPU) is assessed. Th...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
Coupling commodity CPUs and modern GPUs give you heterogeneous systems that are cheap, high-performa...
Engineering, scientific, and financial applications often require the simultaneous solution of a lar...
This thesis spans several research areas, where the main topics being parallel programming based on ...
The standard simplex method is a well-known optimization algorithm for solving linear programming mo...
International audienceThe Simplex algorithm is a well known method to solve linear programming (LP) ...
Abstract—The advent of general purpose graphics processing units (GPGPU’s) brings about a whole new ...
The advances in multi-core architecture for general-purpose computing in the past decade have tremen...
A computational Fluid Dynamics (CFD) code for steady simulations solves a set of non-linear partial ...
This paper presents the use of three-dimensional indexing available in graphic processing units (GPU...
This paper discusses the main performance barriers for solving a large number of independent ordinar...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
The solution of tridiagonal system of equations using graphic processing units (GPU) is assessed. Th...
We study the performance of three parallel algorithms and their hybrid variants for solving tridiago...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
Coupling commodity CPUs and modern GPUs give you heterogeneous systems that are cheap, high-performa...
Engineering, scientific, and financial applications often require the simultaneous solution of a lar...
This thesis spans several research areas, where the main topics being parallel programming based on ...
The standard simplex method is a well-known optimization algorithm for solving linear programming mo...
International audienceThe Simplex algorithm is a well known method to solve linear programming (LP) ...
Abstract—The advent of general purpose graphics processing units (GPGPU’s) brings about a whole new ...
The advances in multi-core architecture for general-purpose computing in the past decade have tremen...
A computational Fluid Dynamics (CFD) code for steady simulations solves a set of non-linear partial ...
This paper presents the use of three-dimensional indexing available in graphic processing units (GPU...
This paper discusses the main performance barriers for solving a large number of independent ordinar...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...