A supercomputer is a repairable system with large number of compute nodes interconnected to work in harmony to achieve superior computational performance. Reliability of such a complex system depends on an effective maintenance strategy that involves both emergency and preventive maintenance. This thesis analyzes the maintenance records of four supercomputers operational at The National Institute of Computational Science located at Oak Ridge National Laboratory. We propose to use the generalized proportional intensities model (GPIM) to model the maintenance interrupts as it can capture both the reliability parameters and maintenance parameters and allows the inclusion of both emergency and preventive maintenance. We use this model to obtain...
International audienceThis paper deals with a predictive maintenance policy for a continuously deter...
Unplanned downtimes caused by system failures incur high costs for many complex systems such as wind...
Most modern systems are equipped with very complex, expensive, and high technology components whose ...
grantor: University of TorontoThis thesis focuses on modeling and optimization of maintena...
The paper presents the explicit expressions for measuring reliability parameter of systems, Graphs ...
This paper describes how multiple preventative maintenance (PM) activities can be modeled in the ava...
Stochastic models are developed for the reliability analysis of repairable systems, based upon the n...
The modeling and design of a fault-tolerant multiprocessor system is addressed in this dissertation....
International audienceThe growing importance of maintenance in the evolving industrial scenario and ...
The Lincoln College Centre for Computing and Biometrics (CCB) administers and maintains much electro...
The use of mathematical modeling for the purpose of analyzing and optimizing the performance of repa...
The objective of the proposed research is to develop statistical algorithms for controlling failure ...
In this paper, we consider the maintenance scheduling of a group of M identical machines, the perfor...
International audienceIn this paper, a modelling approach is presented to assess the performance of ...
A typical maintenance organisation has responsibility in keeping the production facility running at ...
International audienceThis paper deals with a predictive maintenance policy for a continuously deter...
Unplanned downtimes caused by system failures incur high costs for many complex systems such as wind...
Most modern systems are equipped with very complex, expensive, and high technology components whose ...
grantor: University of TorontoThis thesis focuses on modeling and optimization of maintena...
The paper presents the explicit expressions for measuring reliability parameter of systems, Graphs ...
This paper describes how multiple preventative maintenance (PM) activities can be modeled in the ava...
Stochastic models are developed for the reliability analysis of repairable systems, based upon the n...
The modeling and design of a fault-tolerant multiprocessor system is addressed in this dissertation....
International audienceThe growing importance of maintenance in the evolving industrial scenario and ...
The Lincoln College Centre for Computing and Biometrics (CCB) administers and maintains much electro...
The use of mathematical modeling for the purpose of analyzing and optimizing the performance of repa...
The objective of the proposed research is to develop statistical algorithms for controlling failure ...
In this paper, we consider the maintenance scheduling of a group of M identical machines, the perfor...
International audienceIn this paper, a modelling approach is presented to assess the performance of ...
A typical maintenance organisation has responsibility in keeping the production facility running at ...
International audienceThis paper deals with a predictive maintenance policy for a continuously deter...
Unplanned downtimes caused by system failures incur high costs for many complex systems such as wind...
Most modern systems are equipped with very complex, expensive, and high technology components whose ...