
It is estimated that over 90% of all new information produced in the world is being stored on magnetic media, most of it on hard disk drives. Despite their importan ce, there is relatively little published work on the failure patterns of disk drives, and the key factors that affect their lifetime. Most available data are either based on extrapolation from accelerated aging exper- iments or from relatively modest sized field studies. Moreover, larger population studies rarely have the infr astructure in place to collect health signals from components in operation, which is critical information for detailed failure analysis. We present data collected f rom detailed observations of a large disk drive population in a production Internet services de- ployment. The population observed is many times larger than that of previous studies. In addition to presenting failure statis- tics, we analyze the correlation between failures and several parameters generally believed to impact l ongevity. Our analysis identifies several parameters from the drives self monitoring facility (SMART) that correlate highly with failures. Despite this high correla tion, we conclude that mod- els based on SMART parameters alone are unlikely to be useful for predicting individual drive failures. Surprisingly, we found that temp erature and activity levels were much less correlated with drive failures than previously reported.


Google Studie mit über 100.000 getesteten HDDs, um Gründe für Ausfälle zu finden.

Links and resources



  • @cschenk
  • @dblp
  • @jil
  • @kw
  • @bjoern
@cschenk's tags highlighted