Detecting I/O Access Patterns of HPC Workloads at Runtime

被引:8
|
作者
Bez, Jean Luca [1 ]
Boito, Francieli Zanon [2 ]
Nou, Ramon [3 ]
Miranda, Alberto [3 ]
Cortes, Toni [3 ,4 ]
Navaux, Philippe O. A. [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
[2] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LIG, F-38000 Grenoble, France
[3] BSC, Barcelona, Spain
[4] Univ Politecn Cataluna, Barcelona, Spain
基金
欧盟地平线“2020”;
关键词
high-performance computing; parallel I/O; access pattern detection; I/O forwarding; classification;
D O I
10.1109/SBAC-PAD.2019.00025
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we seek to guide optimization and tuning strategies by identifying the application's I/O access pattern. We evaluate three machine learning techniques to automatically detect the I/O access pattern of HPC applications at runtime: decision trees, random forests, and neural networks. We focus on the detection using metrics from file-level accesses as seen by the clients, I/O nodes, and parallel file system servers. We evaluated these detection strategies in a case study in which the accurate detection of the current access pattern is fundamental to adjust a parameter of an I/O scheduling algorithm. We demonstrate that such approaches correctly classify the access pattern, regarding file layout and spatiality of accesses - into the most common ones used by the community and by I/O benchmarking tools to test new I/O optimization - with up to 99% precision. Furthermore, when applied to our study case, it guides a tuning mechanism to achieve 99% of the performance of an Oracle solution.
引用
收藏
页码:80 / 87
页数:8
相关论文
共 50 条
  • [1] Extracting and characterizing I/O behavior of HPC workloads
    Devarajan, Hariharan
    Mohror, Kathryn
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2022), 2022, : 243 - 255
  • [2] An I/O Analysis of HPC Workloads on CephFS and Lustre
    Chiusole, Alberto
    Cozzini, Stefano
    van der Ster, Daniel
    Lamanna, Massimo
    Giuliani, Graziano
    [J]. HIGH PERFORMANCE COMPUTING: ISC HIGH PERFORMANCE 2019 INTERNATIONAL WORKSHOPS, 2020, 11887 : 300 - 316
  • [3] Replicating HPC I/O Workloads With Proxy Applications
    Dickson, James
    Wright, Steven
    Maheswaran, Satheesh
    Herdman, Andy
    Miller, Mark C.
    Jarvis, Stephen
    [J]. PROCEEDINGS OF PDSW-DISCS 2016 - 1ST JOINT INTERNATIONAL WORKSHOP ON PARALLEL DATA STORAGE AND DATA INTENSIVE SCALABLE COMPUTING SYSTEMS, 2016, : 13 - 18
  • [4] IOPin: Runtime Profiling of Parallel I/O in HPC Systems
    Kim, Seong Jo
    Son, Seung Woo
    Liao, Wei-keng
    Kandemir, Mahmut
    Thakur, Rajeev
    Choudhary, Alok
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 18 - 23
  • [5] I/O Access Patterns in HPC Applications: A 360-Degree Survey
    Bez, Jean Luca
    Byna, Suren
    Ibrahim, Shadi
    [J]. ACM COMPUTING SURVEYS, 2024, 56 (02)
  • [6] Parallel I/O Evaluation Techniques and Emerging HPC Workloads: A Perspective
    Neuwirth, Sarah
    Paul, Arnab K.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, : 671 - 679
  • [7] Characterizing I/O Workloads of HPC Applications Through Online Analysis
    Dong, Wenrui
    Liu, Guangming
    Yu, Jie
    Zuo, You
    [J]. 2015 IEEE 34TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2015,
  • [8] Towards I/O analysis of HPC systems and a generic architecture to collect access patterns
    Wiedemann, Marc C.
    Kunkel, Julian M.
    Zimmer, Michaela
    Ludwig, Thomas
    Resch, Michael
    Boenisch, Thomas
    Wang, Xuan
    Chut, Andriy
    Aguilera, Alvaro
    Nagel, Wolfgang E.
    Kluge, Michael
    Mickler, Holger
    [J]. COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2013, 28 (2-3): : 241 - 251
  • [9] Characterizing Machine Learning I/O Workloads on Leadership Scale HPC Systems
    Paul, Arnab K.
    Karimi, Ahmad Maroof
    Wang, Feiyi
    [J]. 29TH INTERNATIONAL SYMPOSIUM ON THE MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2021), 2021, : 198 - 205
  • [10] Does Varying BeeGFS Configuration Affect the I/O Performance of HPC Workloads?
    Borkar, Arnav
    Tony, Joel
    Vamsi, Hari K. N.
    Barman, Tushar
    Bhisikar, Yash
    Sreenath, T. M.
    Paul, Arnab K.
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING WORKSHOPS, CLUSTER WORKSHOPS, 2023, : 5 - 7