Reliability Characterization and Failure Prediction of 3D TLC SSDs in Large-Scale Storage Systems

被引:0
|
作者
Olmez, Serkay [1 ]
机构
[1] Seagate Technol, Seagate Res Grp, Longmont, CO 80503 USA
关键词
Reliability; Three-dimensional displays; Predictive models; Flash memories; Software reliability; Random forests; Hardware; Solid state drive (SSD); reliability; machine learning; prediction methods; data storage system;
D O I
10.1109/TDMR.2021.3077848
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
3D triple-level cell (TLC) NAND flash based solid state drive (SSD) is gradually becoming the dominant storage media in large-scale storage systems due to high storage density and low cost-per-bit. It ranks one of the top replaced hardware components in systems and their enormous amount also indirectly increases the failure probability, resulting in irreversible data loss disaster and service unavailability. This paper for the first time investigates system-level 3D TLC SSDs to characterize reliability and sub-health status based on field Self-Monitoring, Analysis and Reporting Technology (SMART) data, and predict impending failure proactively. We explore real-world datasets and derive some findings for each selected attribute in predetermined categories, contributing to the following feature selection and enhancing the interpretability of prediction models. Moreover, various machine learning models are trained to predict failures ahead of time, and experimental results show that random forest model can achieve 0.636 f(1)-score and 0.662 MCC for a 7-day prediction horizon, and 42.5% true positive rate (TPR) with 0.00% false positive rate (FPR). Different time window sizes, training set fractions and ratios of negative to positive are analyzed as well.
引用
收藏
页码:267 / 272
页数:6
相关论文
共 50 条
  • [1] Reliability of SSDs in Enterprise Storage Systems: A Large-Scale Field Study
    Maneas, Stathis
    Mahdaviani, Kaveh
    Emami, Tim
    Schroeder, Bianca
    [J]. ACM TRANSACTIONS ON STORAGE, 2021, 17 (01)
  • [2] Operational Characteristics of SSDs in Enterprise Storage Systems: A Large-Scale Field Study
    Maneas, Stathis
    Mandaviani, Kaveh
    Emami, Tim
    Schroeder, Bianca
    [J]. PROCEEDINGS OF THE 20TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, FAST 2022, 2022, : 165 - 180
  • [3] Tools for Predicting the Reliability of Large-Scale Storage Systems
    Hall, Robert J.
    [J]. ACM TRANSACTIONS ON STORAGE, 2016, 12 (04)
  • [4] Proactive Drive Failure Prediction for Large Scale Storage Systems
    Zhu, Bingpeng
    Wang, Gang
    Liu, Xiaoguang
    Hu, Dianming
    Lin, Sheng
    Ma, Jingwei
    [J]. 2013 IEEE 29TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2013,
  • [5] Failure probability of goaf in large-scale based on simulation of FLAC(3D)
    Shang Zhen-hua
    Tang Shao-hui
    Jiao Wen-yu
    Liu Chang
    [J]. ROCK AND SOIL MECHANICS, 2014, 35 (10) : 3000 - 3006
  • [6] Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving
    Tian, Xiaoyu
    Jiang, Tao
    Yun, Longfei
    Mao, Yucheng
    Yang, Huitong
    Wang, Yue
    Wang, Yilun
    Zhao, Hang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Work-in-Process: Smart Migration for Reliability Enhancement of 3D TLC NAND Flash Storage Systems
    Du, Yazhi
    Gu, Jihua
    Xiao, Zhongzhe
    Huang, Min
    [J]. PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURE, AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2020, : 4 - 5
  • [8] 3D Stochastic Geometry Model for Large-Scale Molecular Communication Systems
    Deng, Yansha
    Noel, Adam
    Guo, Weisi
    Nallanathan, Arumugam
    Elkashlan, Maged
    [J]. 2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [9] Scalable 3D representation for 3D video in a large-scale space
    Kitahara, I
    Ohta, Y
    [J]. PRESENCE-VIRTUAL AND AUGMENTED REALITY, 2004, 13 (02): : 164 - 177
  • [10] 3D Laser Omnimapping for 3D Reconstruction of Large-Scale Scenes
    Hu, Shaoxing
    Zhang, Aiwu
    [J]. 2009 JOINT URBAN REMOTE SENSING EVENT, VOLS 1-3, 2009, : 688 - +