We found a match
Your institution may have rights to this item. Sign in to continue.
- Title
Predicting Hard Disk Failure by Means of Automatized Labeling and Machine Learning Approach.
- Authors
Gargiulo, Federico; Duellmann, Dirk; Arpaia, Pasquale; Schiano Lo Moriello, Rosario
- Abstract
Today, cloud systems provide many key services to development and production environments; reliable storage services are crucial for a multitude of applications ranging from commercial manufacturing, distribution and sales up to scientific research, which is often at the forefront of computing resource demands. In large-scale computer centers, the storage system requires particular attention and investment; usually, a large number of diverse storage devices need to be deployed in order to match the varying performance and volume requirements of changing user applications. As of today, magnetic drives still play a dominant role in terms of deployed storage volume and of service outages due to device failure. In this paper, we study methods to facilitate automated proactive disk replacement. We propose a method to identify disks with media failures in a production environment and describe an application of supervised machine learning to predict disk failures. In particular, a proper stage to automatically label (healthy/at-risk) the disks during the training and validation stage is presented along with tuning strategy to optimize the hyperparameters of the associated machine learning classifier. The approach is trained and validated against a large set of 65,000 hard drives in the CERN computer center, and the achieved results are discussed.
- Subjects
EUROPEAN Organization for Nuclear Research; HARD disks; SUPERVISED learning; MACHINE learning; OPTICAL disks; CLOUD storage; COMPUTATION laboratories; ON-demand computing
- Publication
Applied Sciences (2076-3417), 2021, Vol 11, Issue 18, p8293
- ISSN
2076-3417
- Publication type
Article
- DOI
10.3390/app11188293