To realize the full potential of a high-performance computing system with a reconfigurable interconnect, there is a need to design algorithms for computing a topology that will allow for a high-throughput load distribution, while simultaneously partitioning the computational task graph of the application for the computed topology. In this paper, we propose a new framework that exploits such reconfigurable interconnects to achieve these interdependent goals, i.e., to iteratively co-optimize the network topology configuration, application partitioning and network flow routing to maximize throughput for a given application. We also present a novel way of computing a high-throughput initial topology based on the structural properties of the app...
Exascale performance will be delivered by systems composed of millions of interconnected computing c...
Exascale performance will be delivered by systems composed of millions of interconnected computing c...
Exascale performance will be delivered by systems composed of millions of interconnected computing c...
To realize the full potential of a high-performance computing system with a reconfigurable interconn...
The 2012 IEEE International Parallel and Distributed Symposium (IPDPS), 21-25 May 2012, Shanghai, Ch...
The orchestration of communication of distributed memory parallel applications on a parallel compute...
The Performance of a parallel algorithm depends in part on how the interconnection topology of the t...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
IPDPS 2013: IEEE Workshops & PhD Forum (IPDPSW), Boston (MA), USA, 20-24 May 2013Traditionally, a pa...
Exascale performance will be delivered by systems composed of millions of interconnected computing c...
Exascale performance will be delivered by systems composed of millions of interconnected computing c...
Exascale performance will be delivered by systems composed of millions of interconnected computing c...
To realize the full potential of a high-performance computing system with a reconfigurable interconn...
The 2012 IEEE International Parallel and Distributed Symposium (IPDPS), 21-25 May 2012, Shanghai, Ch...
The orchestration of communication of distributed memory parallel applications on a parallel compute...
The Performance of a parallel algorithm depends in part on how the interconnection topology of the t...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
Several coarse-grain reconfigurable architectures proposed recently consist of a large number of pro...
IPDPS 2013: IEEE Workshops & PhD Forum (IPDPSW), Boston (MA), USA, 20-24 May 2013Traditionally, a pa...
Exascale performance will be delivered by systems composed of millions of interconnected computing c...
Exascale performance will be delivered by systems composed of millions of interconnected computing c...
Exascale performance will be delivered by systems composed of millions of interconnected computing c...