HMM Optimized Modeling of SSD Storage for I/O MapReduce Workloads

被引:0
|
作者
Alsayoud, Fatimah [1 ]
Miri, Ali [1 ]
机构
[1] Ryerson Univ, Dept Comp Sci, Toronto, ON, Canada
关键词
Flash resource management; R/W ratio; IO patterns; Hidden Markov Model; Storage policies; MapReduce Workloads;
D O I
10.1109/iemcon.2019.8936243
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Flash-based SSD draws a considerable interest in big data platforms due to its performance and reliability. However, it still has limited usage as a result of its high cost and limited capacity. Control SSD provisioning on big data platforms reduce storage cost and guarantees performance. The workload is an essential SSD provisioning sources, thus analyzing the characteristics of the workloads would help optimize SSD management design. There is a significant correlation between the workload's IO patterns and the SSD cost and performance. Big data platforms with multi-stage architecture bring challenges into modeling IO patterns where each stage has it is unique IO patterns. Also, big data platforms run on a distributed environment where the workloads are interacting with local and remote storage during the execution. The designed HMM-based IO patterns model considers IO patterns for MapReduce workloads at different stages and different SSD locations. In this paper, we proposed a platform-level SSD, cost-efficiency controller. The controller is responsible for maximizing the SSD lifespan on the Hadoop platform through two phases. First, modeling MapReduce workload's IO patterns by employing the Hidden Markov Model (HMM). Then, defining platform-level SSD allocation policies. The designed allocation policies reduce SSD utilization and improve SSD lifespan on Hadoop by up to %40 compared to static allocation policies.
引用
收藏
页码:177 / 183
页数:7
相关论文
共 50 条
  • [11] I/O characterization and performance evaluation of large-scale storage architectures for heterogeneous workloads
    Kogiou, Olga
    Devarajan, Hariharan
    Wang, Chen
    Yu, Weikuan
    Mohror, Kathryn
    2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING WORKSHOPS, CLUSTER WORKSHOPS, 2023, : 44 - 45
  • [12] A Partners Assisted Virtual Machine Live Storage Migration for Intensive Disk I/O Workloads
    Jin, Xing
    Wang, Hongbo
    Wang, Jianjian
    Cheng, Shiduan
    Li, Jihan
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1693 - 1698
  • [13] Implications of I/O for gang scheduled workloads
    Lee, W
    Frank, M
    Lee, V
    Mackenzie, K
    Rudolph, L
    JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 1997, 1291 : 215 - 237
  • [14] IOscope: A Flexible I/O Tracer for Workloads' I/O Pattern Characterization
    Saif, Abdulqawi
    Nussbaum, Lucas
    Song, Ye-Qiong
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2018, 2018, 11203 : 103 - 116
  • [15] I/O Performance Modeling of Virtualized Storage Systems
    Noorshams, Qais
    Rostami, Kiana
    Kounev, Samuel
    Tuma, Petr
    Reussner, Ralf
    2013 IEEE 21ST INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS & SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2013), 2013, : 121 - +
  • [16] A Modeling Language for MapReduce Programing in a Storage System Perspective
    Yuxin Jing
    Hanpin Wang
    Yu Huang
    Lei Zhang
    Yongzhi Cao
    Journal of Signal Processing Systems, 2018, 90 : 1133 - 1150
  • [17] A Modeling Language for MapReduce Programing in a Storage System Perspective
    Jing, Yuxin
    Wang, Hanpin
    Huang, Yu
    Zhang, Lei
    Cao, Yongzhi
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 90 (8-9): : 1133 - 1150
  • [18] Modeling pre-Exascale AMR Parallel I/O Workloads via Proxy Applications
    Godoy, William F.
    Delozier, Jenna
    Watson, Gregory R.
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 952 - 961
  • [19] Research on I/O cost of MapReduce join
    Song, Jie
    Li, Tian-Tian
    Zhu, Zhi-Liang
    Bao, Yu-Bin
    Yu, Ge
    Ruan Jian Xue Bao/Journal of Software, 2015, 26 (06): : 1438 - 1456
  • [20] Iris: An optimized I/O stack for low-latency storage devices
    Papagiannis, Anastasios
    Saloustros, Giorgos
    Marazakis, Manolis
    Bilas, Angelos
    OPERATING SYSTEMS REVIEW, 2016, 50 (02) : 3 - 11