A new direct linear equation solver is proposed for GPUs. The proposed solver is applied to mechanical system analysis. In contrast to the DFS post-order traversal which is widely used for conventional implementation of supernodal and multifrontal methods, the BFS reverse-level-order traversal has been adopted to obtain more parallelism and a more adaptive control of data size. The proposed implementation allows solving large problems efficiently on many kinds of GPUs. Separators are divided into smaller blocks to further improve the parallel efficiency. Numerical experiments show that the proposed method takes smaller factorization time than CHOLMOD in general and has better operational availability than SPQR. Mechanical dynamic analysis h...
We present graphics processing unit (GPU) data structures and algorithms to efficiently solve sparse...
This paper describes a numerical method for the parallel solution of the differential measure inclus...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
This research proposes an implementation of effective direct linear equation solver for mechanical m...
This research proposes an effective implementation of linear equation solver for an implicit integra...
It is the purpose of this paper to provide an acceleration of waveform relaxation (WR) methods for ...
In finite element software one has to solve a system of non-linear equations, which is commonly simp...
International audienceWe present a set of methods to implement an implicit Finite Element solver on ...
This paper describes an approach to numerically approximate the time evolution of multibody systems ...
This paper presents a Graphics Processing Unit (GPU) acceleration of an iteration-based discrete vel...
doi: 10.1007/978-3-662-43880-0_1We present ideas and first results on a GPU accelerationof a non-lin...
AbstractCurrent trends in high performance computing (HPC) are advancing towards the use of graphics...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
Abstract. Algebraic multigrid methods for large, sparse linear systems are a necessity in many compu...
We present graphics processing unit (GPU) data structures and algorithms to efficiently solve sparse...
We present graphics processing unit (GPU) data structures and algorithms to efficiently solve sparse...
This paper describes a numerical method for the parallel solution of the differential measure inclus...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
This research proposes an implementation of effective direct linear equation solver for mechanical m...
This research proposes an effective implementation of linear equation solver for an implicit integra...
It is the purpose of this paper to provide an acceleration of waveform relaxation (WR) methods for ...
In finite element software one has to solve a system of non-linear equations, which is commonly simp...
International audienceWe present a set of methods to implement an implicit Finite Element solver on ...
This paper describes an approach to numerically approximate the time evolution of multibody systems ...
This paper presents a Graphics Processing Unit (GPU) acceleration of an iteration-based discrete vel...
doi: 10.1007/978-3-662-43880-0_1We present ideas and first results on a GPU accelerationof a non-lin...
AbstractCurrent trends in high performance computing (HPC) are advancing towards the use of graphics...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
Abstract. Algebraic multigrid methods for large, sparse linear systems are a necessity in many compu...
We present graphics processing unit (GPU) data structures and algorithms to efficiently solve sparse...
We present graphics processing unit (GPU) data structures and algorithms to efficiently solve sparse...
This paper describes a numerical method for the parallel solution of the differential measure inclus...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...