LGDec 22, 2020

The Life and Death of SSDs and HDDs: Similarities, Differences, and Prediction Models

arXiv:2012.12373v14 citations
AI Analysis

This research provides insights into the failure mechanisms of HDDs and SSDs, aiding data center operators in predicting and mitigating storage device failures, potentially reducing downtime.

This study analyzes six years of field data from 100,000 HDDs and 30,000 SSDs to characterize failure conditions and root causes. It found that HDD failures are better distinguished by head positioning time rather than age, while SSDs exhibit high infant mortality, and developed accurate machine learning models for failure prediction.

Data center downtime typically centers around IT equipment failure. Storage devices are the most frequently failing components in data centers. We present a comparative study of hard disk drives (HDDs) and solid state drives (SSDs) that constitute the typical storage in data centers. Using a six-year field data of 100,000 HDDs of different models from the same manufacturer from the BackBlaze dataset and a six-year field data of 30,000 SSDs of three models from a Google data center, we characterize the workload conditions that lead to failures and illustrate that their root causes differ from common expectation but remain difficult to discern. For the case of HDDs we observe that young and old drives do not present many differences in their failures. Instead, failures may be distinguished by discriminating drives based on the time spent for head positioning. For SSDs, we observe high levels of infant mortality and characterize the differences between infant and non-infant failures. We develop several machine learning failure prediction models that are shown to be surprisingly accurate, achieving high recall and low false positive rates. These models are used beyond simple prediction as they aid us to untangle the complex interaction of workload characteristics that lead to failures and identify failure root causes from monitored symptoms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes