Predicting file lifetimes for data placement in multi-tiered storage systems for HPC

被引:0
|
作者
Thomas, Luis [1 ]
Gougeaud, Sebastien [2 ]
Rubini, Stephane [3 ]
Deniel, Philippe [2 ]
Boukhobza, Jalil [1 ]
机构
[1] ENSTA Bretagne, Lab STICC, CNRS, UMR 6285, Brest, France
[2] CEA, Bruyeres Le Chatel, France
[3] Univ Brest, Lab STICC, CNRS, UMR 6285, Brest, France
关键词
Data placement; Multi-Tier Storage; File lifetime; Convolutional Neural Network; Machine Learning; High Performance Computing; Heterogeneous Storage; Storage Hierarchy;
D O I
10.1145/3439839.3458733
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The emergence of Exascale machines in HPC will have the foreseen consequence of putting more pressure on the storage systems in place, not only in terms of capacity but also bandwidth and latency. With limited budget we cannot imagine using only storage class memory, which leads to the use of a heterogeneous tiered storage hierarchy. In order to make the most efficient use of the high performance tier in this storage hierarchy, we need to be able to place user data on the right tier and at the right time. In this paper, we assume a 2-tier storage hierarchy with a high performance tier and a high capacity archival tier. Files are placed on the high performance tier at creation time and moved to capacity tier once their lifetime expires (that is once they are no more accessed). The main contribution of this paper lies in the design of a file lifetime prediction model solely based on its path based on the use of Convolutional Neural Network. Results show that our solution strikes a good trade-off between accuracy and under-estimation. Compared to previous work, our model made it possible to reach an accuracy close to previous work (around 98.60% compared to 98.84%) while reducing the underestimations by almost 10x to reach 2.21% (compared to 21.86%). The reduction in underestimations is crucial as it avoids misplacing files in the capacity tier while they are still in use.
引用
收藏
页码:99 / 107
页数:9
相关论文
共 50 条
  • [21] The MWA Archive: A Multi-tiered Dataflow and Storage System
    Wu, Che
    Wicenec, Andreas
    Pallot, Dave
    Checcucci, Alessio
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXII, 2013, 475 : 151 - +
  • [22] Design and Implementation of a Shared Multi-tiered Storage System
    Chang, Hsung-Pin
    Yu, Yu-Cheng
    Chung, Pei-Yao
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 94 - 98
  • [23] Data Temperature Informed Streaming for Optimising Large-Scale Multi-Tiered Storage
    Davies-Tagg, Dominic
    Anjum, Ashiq
    Zahir, Ali
    Liu, Lu
    Yaseen, Muhammad Usman
    Antonopoulos, Nick
    BIG DATA MINING AND ANALYTICS, 2024, 7 (02): : 371 - 398
  • [24] Tiera: Towards Flexible Multi-Tiered Cloud Storage Instances
    Raghavan, Ajaykrishna
    Chandra, Abhishek
    Weissman, Jon B.
    ACM/IFIP/USENIX MIDDLEWARE 2014, 2014, : 1 - 12
  • [25] A Multi-tiered Model for Context-Aware Systems
    da Costa, Cristiano Andre
    Victoria Barbosa, Jorge Luis
    Yamin, Adenauer Correa
    Righi, Rodrigo da Rosa
    Geyer, Claudio Resin
    PROCEEDINGS OF THE 2014 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING (UBICOMP'14 ADJUNCT), 2014, : 31 - 34
  • [26] Performance impacts of autocorrelated flows in multi-tiered systems
    Mi, Ningfang
    Zhang, Qi
    Riska, Alma
    Smirni, Evgenia
    Riedel, Erik
    PERFORMANCE EVALUATION, 2007, 64 (9-12) : 1082 - 1101
  • [27] Ergonomic Issues Associated with Multi-Tiered Zebrafish Systems
    Stewart, K. L.
    JOURNAL OF THE AMERICAN ASSOCIATION FOR LABORATORY ANIMAL SCIENCE, 2009, 48 (05): : 597 - 597
  • [28] Optimizing Object Storage System by the Object Multi-Tiered Balanced Organization
    Zhang, Lei
    Feng, Dongyu
    Lei, Chengxi
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [29] Integrating and Delivering Sound Using Motion Capture and Multi-tiered Speaker Placement
    Hughes, Darin E.
    VIRTUAL AND MIXED REALITY, PROCEEDINGS, 2009, 5622 : 179 - 185
  • [30] Optimizing Object Storage System by the Object Multi-Tiered Balanced Organization
    Zhang, Lei
    Feng, Dongyu
    Lei, Chengxi
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022