Abstract — Modern processors have multiple cores on a chip to overcome power consumption and heat dissipation issues. As more and more compute cores become available on a single node, it is expected that node-local communication will play an increasingly greater role in overall performance of parallel applications such as MPI applications. It is therefore crucial to optimize intra-node communication paths utilized by MPI libraries. In this paper, we propose a novel design of a kernel extension, called LiMIC2, for high-performance MPI intra-node communication over multi-core systems. LiMIC2 can minimize the communication overheads by implementing lightweight primitives and provide portability across different interconnects and flexibility fo...
This paper describes the basic concepts of our solution to improve the performance of Ethernet Commu...
Modern multi-core clusters are increasingly using GPUs to achieve higher performance and power effic...
This paper presents a new low-level communication subsystem called Nemesis. Nemesis has been designe...
The emergence of multicore processors raises the need to efficiently transfer large amounts of data ...
Summarization: Highly parallel systems are becoming mainstream in a wide range of sectors ranging fr...
Abstract—Modern high-speed interconnection networks are designed with capabilities to support commun...
Abstract—As the number of cores per node increases in modern clusters, intra-node communication effi...
MPI is one of the most widely used APIs for parallel supercomputing and appears to map well to a lar...
International audienceThe multiplication of cores in today's architectures raises the importance of ...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
(eng) Running parallel applications on clusters with high-speed local networks requires fast communi...
Looking at the TOP 500 list of supercomputers we can see that different architectures and networking...
Abstract. This paper presents a method to efficiently place MPI pro-cesses on multicore machines. Si...
International audience—Power dissipation and energy consumption has become a major issue for high pe...
Parallel computing on clusters of workstations and personal computers has very high potential, since...
This paper describes the basic concepts of our solution to improve the performance of Ethernet Commu...
Modern multi-core clusters are increasingly using GPUs to achieve higher performance and power effic...
This paper presents a new low-level communication subsystem called Nemesis. Nemesis has been designe...
The emergence of multicore processors raises the need to efficiently transfer large amounts of data ...
Summarization: Highly parallel systems are becoming mainstream in a wide range of sectors ranging fr...
Abstract—Modern high-speed interconnection networks are designed with capabilities to support commun...
Abstract—As the number of cores per node increases in modern clusters, intra-node communication effi...
MPI is one of the most widely used APIs for parallel supercomputing and appears to map well to a lar...
International audienceThe multiplication of cores in today's architectures raises the importance of ...
In exascale computing era, applications are executed at larger scale than ever before, whichresults ...
(eng) Running parallel applications on clusters with high-speed local networks requires fast communi...
Looking at the TOP 500 list of supercomputers we can see that different architectures and networking...
Abstract. This paper presents a method to efficiently place MPI pro-cesses on multicore machines. Si...
International audience—Power dissipation and energy consumption has become a major issue for high pe...
Parallel computing on clusters of workstations and personal computers has very high potential, since...
This paper describes the basic concepts of our solution to improve the performance of Ethernet Commu...
Modern multi-core clusters are increasingly using GPUs to achieve higher performance and power effic...
This paper presents a new low-level communication subsystem called Nemesis. Nemesis has been designe...