International audienceScalable shared memory multiprocessors are promising architectures to achieve teraflops computational power. As they contain a large number of processor and memory elements, such machines have a high probability of failure. In this paper, we investigate an approach based on backward error recovery to provide a highly available scalable shared memory architecture tolerating transient and permanent processor and memory failures
Soft errors are adding another dimension to the present day architecture design space. Different tec...
The always increasing performance demands of applications such as cryptography, scientific simulatio...
System reliability is an important issue in designing modern multiprocessor systems. This paper prop...
International audienceScalable shared memory multiprocessors are promising architectures to achieve ...
In this paper, we focus on the problem of recovering processor failures in shared memory multiproces...
International audienceDue to the increasing number of their components, Scalable Shared Memory Multi...
: Distributed Shared Memory (dsm) architectures are attractive to execute high performance parallel ...
: COMAs (Cache Only Memory Architectures) are an interesting class of large scale shared memory mult...
International audienceDistributed Shared Memory (DSM) architectures are attractive to execute high p...
The concept of backward recovery is now well established as a means of restoring a consistent state ...
Due to the increasing number of their components, Scalable Shared Memory Multiprocessors (SSMMs) hav...
This thesis focuses on the issue of reliability and fault tolerance in Distributed Shared Memory Mul...
L'augmentation continue de la puissance de calcul requise par les applications telles que la cryptog...
A network multicomputer is a multiprocessor in which the processors are connected by general-purpose...
Abstract—Reducing device dimensions, increasing transistor densities, and smaller timing windows, ex...
Soft errors are adding another dimension to the present day architecture design space. Different tec...
The always increasing performance demands of applications such as cryptography, scientific simulatio...
System reliability is an important issue in designing modern multiprocessor systems. This paper prop...
International audienceScalable shared memory multiprocessors are promising architectures to achieve ...
In this paper, we focus on the problem of recovering processor failures in shared memory multiproces...
International audienceDue to the increasing number of their components, Scalable Shared Memory Multi...
: Distributed Shared Memory (dsm) architectures are attractive to execute high performance parallel ...
: COMAs (Cache Only Memory Architectures) are an interesting class of large scale shared memory mult...
International audienceDistributed Shared Memory (DSM) architectures are attractive to execute high p...
The concept of backward recovery is now well established as a means of restoring a consistent state ...
Due to the increasing number of their components, Scalable Shared Memory Multiprocessors (SSMMs) hav...
This thesis focuses on the issue of reliability and fault tolerance in Distributed Shared Memory Mul...
L'augmentation continue de la puissance de calcul requise par les applications telles que la cryptog...
A network multicomputer is a multiprocessor in which the processors are connected by general-purpose...
Abstract—Reducing device dimensions, increasing transistor densities, and smaller timing windows, ex...
Soft errors are adding another dimension to the present day architecture design space. Different tec...
The always increasing performance demands of applications such as cryptography, scientific simulatio...
System reliability is an important issue in designing modern multiprocessor systems. This paper prop...