Predicting file lifetimes for data placement in multi-tiered storage systems for HPC

被引:0
|
作者
Thomas, Luis [1 ]
Gougeaud, Sebastien [2 ]
Rubini, Stephane [3 ]
Deniel, Philippe [2 ]
Boukhobza, Jalil [1 ]
机构
[1] ENSTA Bretagne, Lab STICC, CNRS, UMR 6285, Brest, France
[2] CEA, Bruyeres Le Chatel, France
[3] Univ Brest, Lab STICC, CNRS, UMR 6285, Brest, France
关键词
Data placement; Multi-Tier Storage; File lifetime; Convolutional Neural Network; Machine Learning; High Performance Computing; Heterogeneous Storage; Storage Hierarchy;
D O I
10.1145/3439839.3458733
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The emergence of Exascale machines in HPC will have the foreseen consequence of putting more pressure on the storage systems in place, not only in terms of capacity but also bandwidth and latency. With limited budget we cannot imagine using only storage class memory, which leads to the use of a heterogeneous tiered storage hierarchy. In order to make the most efficient use of the high performance tier in this storage hierarchy, we need to be able to place user data on the right tier and at the right time. In this paper, we assume a 2-tier storage hierarchy with a high performance tier and a high capacity archival tier. Files are placed on the high performance tier at creation time and moved to capacity tier once their lifetime expires (that is once they are no more accessed). The main contribution of this paper lies in the design of a file lifetime prediction model solely based on its path based on the use of Convolutional Neural Network. Results show that our solution strikes a good trade-off between accuracy and under-estimation. Compared to previous work, our model made it possible to reach an accuracy close to previous work (around 98.60% compared to 98.84%) while reducing the underestimations by almost 10x to reach 2.21% (compared to 21.86%). The reduction in underestimations is crucial as it avoids misplacing files in the capacity tier while they are still in use.
引用
收藏
页码:99 / 107
页数:9
相关论文
共 50 条
  • [41] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems
    Kim, Jonghyeon
    Choe, Wonkyo
    Ahn, Jeongseob
    PROCEEDINGS OF THE 2021 USENIX ANNUAL TECHNICAL CONFERENCE, 2021, : 715 - 728
  • [42] Adapting Social Emotional Multi-Tiered Systems of Supports for Kindergarten Classrooms
    Steed, Elizabeth A.
    Shapland, Dorothy
    EARLY CHILDHOOD EDUCATION JOURNAL, 2020, 48 (02) : 135 - 146
  • [43] Multi-Tiered Systems of Support Within Secure Residential Juvenile Facilities
    Jolivette, Kristine
    Scheuermann, Brenda
    Ennis, Robin
    RESIDENTIAL TREATMENT FOR CHILDREN & YOUTH, 2015, 32 (04) : 254 - 257
  • [44] Multi-Tiered Visual Interfaces for Book Search with Digital Library Systems
    Short, Gregory
    Kim, Beomjin
    2014 6TH INTERNATIONAL CONFERENCE ON MULTIMEDIA, COMPUTER GRAPHICS AND BROADCASTING (MULGRAB), 2014, : 21 - 24
  • [45] Integrated Multi-Tiered Systems of Support in Elementary Schools: Practical Applications
    Majeika, Caitlyn E.
    Pierce, Jennifer
    Smith, Heather
    Lembke, Erica
    Gandhi, Allison
    INTERVENTION IN SCHOOL AND CLINIC, 2024, 60 (01) : 53 - 61
  • [46] Adapting Social Emotional Multi-Tiered Systems of Supports for Kindergarten Classrooms
    Elizabeth A. Steed
    Dorothy Shapland
    Early Childhood Education Journal, 2020, 48 : 135 - 146
  • [47] ExaPlan: Queueing-Based Data Placement and Provisioning for Large Tiered Storage Systems
    Iliadis, Ilias
    Jelitto, Jens
    Kim, Yusik
    Sarafijanovic, Slavisa
    Venkatesan, Vinodh
    2015 IEEE 23RD INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2015), 2015, : 218 - 227
  • [48] MOBBS: A Multi-tiered Block Storage System for Virtual Machines using Object-based Storage
    Ma, Sixiang
    Chen, Haopeng
    Lu, Heng
    Wei, Bin
    He, Pujiang
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 272 - 275
  • [49] DMS: a Dynamic Multi-tiered Storage with Deduplication Based on Variable-Sized Chunks
    Liu, Xiao
    Zhou, Bin
    PROCEEDINGS OF 2017 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2017), 2017, : 127 - 131
  • [50] A multi-tiered data structure and process management system based on ROOT and CouchDB
    Cox, G. A.
    Armengaud, E.
    Augier, C.
    Benoit, A.
    Berge, L.
    Bergmann, T.
    Bluemer, J.
    Bres, G.
    Broniatowski, A.
    Brudanin, V.
    Censier, B.
    Chapellier, M.
    Chardin, G.
    Charlieux, F.
    Collin, S.
    Coulter, P.
    Crauste, O.
    De Jesus, M.
    Domange, J.
    Dumoulin, L.
    Eitel, K.
    Filosofov, D.
    Fourches, N.
    Gascon, J.
    Gerbier, G.
    Gironne, J.
    Gros, M.
    Henry, S.
    Herve, S.
    Jokisch, S.
    Juillard, A.
    Kleifges, M.
    Kluck, H.
    Kozlovg, V.
    Kraus, H.
    Kudryavtsev, V. A.
    Loaiza, P.
    Marnieros, S.
    Menshikov, A.
    Navick, X. -F.
    Nones, C.
    Olivieri, E.
    Pari, P.
    Pattavina, L.
    Paul, B.
    Robinson, M.
    Rodenas, H.
    Rozov, S.
    Sanglard, V.
    Schmidt, B.
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2012, 684 : 63 - 72