The design of modern parallel machines leads to powerful machines, but with complex architectures and hierarchical topologies. As a result, communication overheads associated with hardware asymmetry and interconnection network increase. In order to achieve scalable performances on these machines, it is essential to reduce communication costs on parallel applications such as CP2K. From computational chemistry domain, CP2K is a real-world parallel application that performs atomistic and molecular simulations. A linear-scaling density functional theory implementation based on an efficient sparse linear algebra kernel allows CP2K to simulate a million of atoms. Since this kernel is communication bound, the hardware asymmetry and interconnection...
Parallel sparse matrix-matrix multiplication algorithms (PSpGEMM) spend most of their running time o...
This report describes the work undertaken under PRACE-1IP to support the European scientific communi...
CP2K is a powerful materials science and computational chemistry code and is widely used by research...
The design of modern parallel machines leads to powerful machines, but with complex architectures an...
CP2K is a freely available atomistic and molecular simulation code, able to study of a wide range of...
In parallel computing environments from multicore systems to cloud computers and supercomputers, dat...
Recent years have witnessed a tremendous surge of interest in accelerating sparse linear algebra app...
Recent years have witnessed a tremendous surge of interest in accelerating sparse linear algebra app...
Parallelizing sparse irregular application on distributed memory systems poses serious scalability c...
Paper presented at CUG 2010, EdinburghCP2K is a freely available and increasingly popular Density Fu...
Sparse matrix operations dominate the cost of many scientific applications. In parallel, the perform...
Sparse matrix operations dominate the cost of many scientific applications. In parallel, the perform...
Compared to the customary column-oriented approaches, block-oriented, distributed-memory sparse Chol...
Communication and topology aware process mapping is a powerful approach to reduce communication time...
We show teraflop performance of the fully featured ab initio molecular dynamics code CPMD on an IBM ...
Parallel sparse matrix-matrix multiplication algorithms (PSpGEMM) spend most of their running time o...
This report describes the work undertaken under PRACE-1IP to support the European scientific communi...
CP2K is a powerful materials science and computational chemistry code and is widely used by research...
The design of modern parallel machines leads to powerful machines, but with complex architectures an...
CP2K is a freely available atomistic and molecular simulation code, able to study of a wide range of...
In parallel computing environments from multicore systems to cloud computers and supercomputers, dat...
Recent years have witnessed a tremendous surge of interest in accelerating sparse linear algebra app...
Recent years have witnessed a tremendous surge of interest in accelerating sparse linear algebra app...
Parallelizing sparse irregular application on distributed memory systems poses serious scalability c...
Paper presented at CUG 2010, EdinburghCP2K is a freely available and increasingly popular Density Fu...
Sparse matrix operations dominate the cost of many scientific applications. In parallel, the perform...
Sparse matrix operations dominate the cost of many scientific applications. In parallel, the perform...
Compared to the customary column-oriented approaches, block-oriented, distributed-memory sparse Chol...
Communication and topology aware process mapping is a powerful approach to reduce communication time...
We show teraflop performance of the fully featured ab initio molecular dynamics code CPMD on an IBM ...
Parallel sparse matrix-matrix multiplication algorithms (PSpGEMM) spend most of their running time o...
This report describes the work undertaken under PRACE-1IP to support the European scientific communi...
CP2K is a powerful materials science and computational chemistry code and is widely used by research...