Computing grids consist of a large-scale, highly-distributed hardware architecture, often built in a hierarchical way, as cluster federations. At such scales, failures are no longer exceptions, but part of the normal behavior. When designing software for grids, developers have to take failures into account, in order to be able to provide a stable service. The fault-tolerance mechanisms need to be validated and evaluated. It is therefore crucial to make experiments at a large scale, with various volatility conditions, in order to measure the impact of failures on the whole system. This paper presents an experimental tool allowing the user to control the volatility conditions during a practical evaluation of fault-tolerant systems. The tool i...
International audienceDistributed computing infrastructures support system and network fault-toleran...
Building an infrastructure for exascale applications requires, in addition to many other key compone...
Dans un réseau constitué de plusieurs milliers d ordinateurs, l apparition de fautes est inévitable....
Computing grids consist of a large-scale, highly-distributed hardware architecture, often built in a...
Selected for publication in the post-conference bookComputing grids are large-scale, highly-distribu...
This work deals with high performance computing on large scale platforms like computing grids. Compu...
ap por t de r ech er ch e INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE Using fai...
The paper introduces FTAPE (Fault Tolerance And Performance Evaluator), a tool that can be used to c...
One of the topics of paramount importance in the development of Cluster and Grid middleware is the i...
La tolérance et la gestion des fautes dans les grilles de données/calcul est d’une importance capita...
The construction of grid computing is one of the major research on networked computer systems. The m...
A l'ère de l’informatique omniprésente et à la demande, où les applications et les services sont dép...
Dependability evaluation involves the study of failures and errors. The destructive nature of a cras...
This paper describes FTAPE (Fault Tolerance And Performance Evaluator), a tool that can be used to c...
L'objectif principal de cette thèse est de développer des techniques d'analyse et mitigation capable...
International audienceDistributed computing infrastructures support system and network fault-toleran...
Building an infrastructure for exascale applications requires, in addition to many other key compone...
Dans un réseau constitué de plusieurs milliers d ordinateurs, l apparition de fautes est inévitable....
Computing grids consist of a large-scale, highly-distributed hardware architecture, often built in a...
Selected for publication in the post-conference bookComputing grids are large-scale, highly-distribu...
This work deals with high performance computing on large scale platforms like computing grids. Compu...
ap por t de r ech er ch e INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE Using fai...
The paper introduces FTAPE (Fault Tolerance And Performance Evaluator), a tool that can be used to c...
One of the topics of paramount importance in the development of Cluster and Grid middleware is the i...
La tolérance et la gestion des fautes dans les grilles de données/calcul est d’une importance capita...
The construction of grid computing is one of the major research on networked computer systems. The m...
A l'ère de l’informatique omniprésente et à la demande, où les applications et les services sont dép...
Dependability evaluation involves the study of failures and errors. The destructive nature of a cras...
This paper describes FTAPE (Fault Tolerance And Performance Evaluator), a tool that can be used to c...
L'objectif principal de cette thèse est de développer des techniques d'analyse et mitigation capable...
International audienceDistributed computing infrastructures support system and network fault-toleran...
Building an infrastructure for exascale applications requires, in addition to many other key compone...
Dans un réseau constitué de plusieurs milliers d ordinateurs, l apparition de fautes est inévitable....