In modern data centers, storage system failures are major contributors to downtimes and maintenance costs. Predicting these failures by collecting measurements from disks and analyzing them with machine learning techniques can effectively reduce their impact, enabling timely maintenance. While there is a vast literature on this subject, most approaches attempt to predict hard disk failures using either classic machine learning solutions, such as Random Forests (RFs) or deep Recurrent Neural Networks (RNNs). In this work, we address hard disk failure prediction using Temporal Convolutional Networks (TCNs), a novel type of deep neural network for time series analysis. Using a real-world dataset, we show that TCNs outperform both RFs and RNNs....
Due to the rapidly increasing storage consumption worldwide, as well as the expectation of continuou...
Over the last few decades, reliability analysis has gained more and more attention as it can be bene...
The hard drive is one of the important components of a computing system, and its failure can lead to...
In modern data centers, storage system failures are major contributors to downtimes and maintenance ...
Various studies have attempted to predict individual disk failures based on the values of the SMART ...
Many modern hardware systems are equipped with sensors which record time-series diagnostic data. Th...
The increase in the production of digital information every second has raised the demand for storage...
Hard disk drive failures are one of the most common causes of service downtime in data centers. Pred...
With the increase in intelligence applications and services, like real-time video surveillance syste...
Digital data storage systems such as hard drives can suffer breakdowns that cause the loss of stored...
Time series data often involves big size environment that lead to high dimensionality problem. Many ...
International audienceIn order to reduce the consequences of hard drives failures which can lead to ...
We compare machine learning methods applied to a difficult real-world problem: predicting com-puter ...
Today, cloud systems provide many key services to development and production environments; reliable ...
From genomic sequencing to weather forecasting, high-performance computing systems (HPCs) have prof...
Due to the rapidly increasing storage consumption worldwide, as well as the expectation of continuou...
Over the last few decades, reliability analysis has gained more and more attention as it can be bene...
The hard drive is one of the important components of a computing system, and its failure can lead to...
In modern data centers, storage system failures are major contributors to downtimes and maintenance ...
Various studies have attempted to predict individual disk failures based on the values of the SMART ...
Many modern hardware systems are equipped with sensors which record time-series diagnostic data. Th...
The increase in the production of digital information every second has raised the demand for storage...
Hard disk drive failures are one of the most common causes of service downtime in data centers. Pred...
With the increase in intelligence applications and services, like real-time video surveillance syste...
Digital data storage systems such as hard drives can suffer breakdowns that cause the loss of stored...
Time series data often involves big size environment that lead to high dimensionality problem. Many ...
International audienceIn order to reduce the consequences of hard drives failures which can lead to ...
We compare machine learning methods applied to a difficult real-world problem: predicting com-puter ...
Today, cloud systems provide many key services to development and production environments; reliable ...
From genomic sequencing to weather forecasting, high-performance computing systems (HPCs) have prof...
Due to the rapidly increasing storage consumption worldwide, as well as the expectation of continuou...
Over the last few decades, reliability analysis has gained more and more attention as it can be bene...
The hard drive is one of the important components of a computing system, and its failure can lead to...