Abstract—Massively parallel computing systems are being built with thousands of nodes. The interconnection network plays a key role for the performance of such systems. However, the high number of components significantly increases the probability of failure. Additionally, failures in the interconnection network may isolate a large fraction of the machine. It is therefore critical to provide an efficient fault-tolerant mechanism to keep the system running, even in the presence of faults. This paper presents a new fault-tolerant routing methodology that does not degrade performance in the absence of faults and tolerates a reasonably large number of faults without disabling any healthy node. In order to avoid faults, for some source-destinati...
This paper presents a software-based approach to fault-tolerant routing in networks using wormhole o...
International audiencen online fault tolerant routing algorithm for 2DMesh Networks-on-Chip is prese...
Abstract—A technique to enhance multicomputer routers for fault-tolerant routing with modest increas...
ISBN 978-1-4244-7628-2International audienceFuture applications will require processors with many co...
[[abstract]]In this paper, we propose an adaptive and deadlock-free routing algorithm to tolerate ir...
Exascale computing systems are being built with thousands of nodes. The high number of components of...
Massively parallel computing systems are being built with hundreds or thousands of components such a...
In this thesis, we present fault-tolerant routing policies based on concepts of adaptability and dea...
The intensive and continuous use of high-performance computers for executing computationally intensi...
The intensive and continuous use of high-performance computers for executing computationally intensi...
In this thesis, we present fault-tolerant routing policies based on concepts of adaptability and dea...
In this thesis, we present fault-tolerant routing policies based on concepts of adaptability and dea...
This paper presents a software based approach to fault-tolerant routing in networks using wormhole o...
AbstractAdaptive and fault-tolerant schemes for routing messages in a 2D torus interconnection netwo...
Currently, clusters of PCs are considered a cost-effective alternative to large parallel computers. ...
This paper presents a software-based approach to fault-tolerant routing in networks using wormhole o...
International audiencen online fault tolerant routing algorithm for 2DMesh Networks-on-Chip is prese...
Abstract—A technique to enhance multicomputer routers for fault-tolerant routing with modest increas...
ISBN 978-1-4244-7628-2International audienceFuture applications will require processors with many co...
[[abstract]]In this paper, we propose an adaptive and deadlock-free routing algorithm to tolerate ir...
Exascale computing systems are being built with thousands of nodes. The high number of components of...
Massively parallel computing systems are being built with hundreds or thousands of components such a...
In this thesis, we present fault-tolerant routing policies based on concepts of adaptability and dea...
The intensive and continuous use of high-performance computers for executing computationally intensi...
The intensive and continuous use of high-performance computers for executing computationally intensi...
In this thesis, we present fault-tolerant routing policies based on concepts of adaptability and dea...
In this thesis, we present fault-tolerant routing policies based on concepts of adaptability and dea...
This paper presents a software based approach to fault-tolerant routing in networks using wormhole o...
AbstractAdaptive and fault-tolerant schemes for routing messages in a 2D torus interconnection netwo...
Currently, clusters of PCs are considered a cost-effective alternative to large parallel computers. ...
This paper presents a software-based approach to fault-tolerant routing in networks using wormhole o...
International audiencen online fault tolerant routing algorithm for 2DMesh Networks-on-Chip is prese...
Abstract—A technique to enhance multicomputer routers for fault-tolerant routing with modest increas...