HMM Optimized Modeling of SSD Storage for I/O MapReduce Workloads

被引:0
|
作者
Alsayoud, Fatimah [1 ]
Miri, Ali [1 ]
机构
[1] Ryerson Univ, Dept Comp Sci, Toronto, ON, Canada
关键词
Flash resource management; R/W ratio; IO patterns; Hidden Markov Model; Storage policies; MapReduce Workloads;
D O I
10.1109/iemcon.2019.8936243
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Flash-based SSD draws a considerable interest in big data platforms due to its performance and reliability. However, it still has limited usage as a result of its high cost and limited capacity. Control SSD provisioning on big data platforms reduce storage cost and guarantees performance. The workload is an essential SSD provisioning sources, thus analyzing the characteristics of the workloads would help optimize SSD management design. There is a significant correlation between the workload's IO patterns and the SSD cost and performance. Big data platforms with multi-stage architecture bring challenges into modeling IO patterns where each stage has it is unique IO patterns. Also, big data platforms run on a distributed environment where the workloads are interacting with local and remote storage during the execution. The designed HMM-based IO patterns model considers IO patterns for MapReduce workloads at different stages and different SSD locations. In this paper, we proposed a platform-level SSD, cost-efficiency controller. The controller is responsible for maximizing the SSD lifespan on the Hadoop platform through two phases. First, modeling MapReduce workload's IO patterns by employing the Hidden Markov Model (HMM). Then, defining platform-level SSD allocation policies. The designed allocation policies reduce SSD utilization and improve SSD lifespan on Hadoop by up to %40 compared to static allocation policies.
引用
收藏
页码:177 / 183
页数:7
相关论文
共 50 条
  • [1] An Analytical Approach to Evaluation of SSD Effects under MapReduce Workloads
    Ahn, Sungyong
    Park, Sangkyu
    JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2015, 15 (05) : 511 - 518
  • [2] Understanding Vertical Scalability of I/O Virtualization for MapReduce Workloads: Challenges and Opportunities
    Nicolae, Bogdan
    EURO-PAR 2013: PARALLEL PROCESSING WORKSHOPS, 2014, 8374 : 3 - 12
  • [3] Characterization of I/O Behaviors in Cloud Storage Workloads
    Zou, Qiang
    Zhu, Yifeng
    Chen, Jianxi
    Deng, Yuhui
    Qin, Xiao
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (10) : 2726 - 2739
  • [4] Is Performance of Object Storage Predictable for Serverless I/O Workloads? A Comparative Study
    Eizaguirre, German T.
    Sanchez-Artigas, Marc
    2023 IEEE 31ST INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS, ICNP, 2023,
  • [5] GREM: Dynamic SSD Resource Allocation In Virtualized Storage Systems With Heterogeneous IO Workloads
    Yang, Zhengyu
    Tai, Jianzhe
    Bhimani, Janki
    Wang, Jiayin
    Mi, Ningfang
    Sheng, Bo
    2016 IEEE 35TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2016,
  • [6] Re-evaluating Designs for Multi-Tenant OLTP Workloads on SSD-based I/O Subsystems
    Zhang, Ning
    Tatemura, Junichi
    Patel, Jignesh M.
    Hacigumus, Hakan
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1383 - 1394
  • [7] Optimized I/O Determinism for Emerging NVM-based NVMe SSD in an Enterprise System
    Kim, Seonbong
    Yang, Joon-Sung
    2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [8] Evaluating Reliability of SSD-Based I/O Caches in Enterprise Storage Systems
    Ahmadian, Saba
    Taheri, Farhad
    Asadi, Hossein
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (04) : 1914 - 1929
  • [9] Storage SLA Guarantee with Novel SSD I/O Scheduler in Virtualized Data Centers
    Park, Hyunchan
    Yoo, Seehwan
    Hong, Cheol-Ho
    Yoo, Chuck
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (08) : 2422 - 2434
  • [10] Analyzing Enterprise Storage Workloads With Graph Modeling and Clustering
    Zhou, Yang
    Liu, Ling
    Seshadri, Sangeetha
    Chiu, Lawrence
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2016, 34 (03) : 551 - 574