I/O Characterization and Performance Evaluation of BeeGFS for Deep Learning

被引:50
|
作者
Chowdhury, Fahim [1 ]
Zhu, Yue [1 ]
Heer, Todd [2 ]
Paredes, Saul [1 ]
Moody, Adam [2 ]
Goldstone, Robin [2 ]
Mohror, Kathryn [2 ]
Yu, Weikuan [1 ]
机构
[1] Florida State Univ, Tallahassee, FL 32306 USA
[2] Lawrence Livermore Natl Lab, Livermore, CA USA
来源
PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019) | 2019年
基金
美国国家科学基金会;
关键词
D O I
10.1145/3337821.3337902
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Parallel File Systems (PFSs) are frequently deployed on leadership High Performance Computing (HPC) systems to ensure efficient I/O, persistent storage and scalable performance. Emerging Deep Learning (DL) applications incur new I/O and storage requirements to HPC systems with batched input of small random files. This mandates PFSs to have commensurate features that can meet the needs of DL applications. BeeGFS is a recently emerging PFS that has grabbed the attention of the research and industry world because of its performance, scalability and ease of use. While emphasizing a systematic performance analysis of BeeGFS, in this paper, we present the architectural and system features of BeeGFS, and perform an experimental evaluation using cutting-edge I/O, Metadata and DL application benchmarks. Particularly, we have utilized AlexNet and ResNet-50 models for the classification of ImageNet dataset using the Livermore Big Artificial Neural Network Toolkit (LBANN), and ImageNet data reader pipeline atop TensorFlow and Horovod. Through extensive performance characterization of BeeGFS, our study provides a useful documentation on how to leverage BeeGFS for the emerging DL applications.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Performance Analysis and Characterization of Training Deep Learning Models on Mobile Device
    Liu, Jie
    Liu, Jiawen
    Du, Wan
    Li, Dong
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 506 - 515
  • [42] Performance Evaluation of Collective Write Algorithms in MPI I/O
    Chaarawi, Mohamad
    Chandok, Suneet
    Gabriel, Edgar
    COMPUTATIONAL SCIENCE - ICCS 2009, PART I, 2009, 5544 : 185 - 194
  • [43] Parasitic extraction and performance evaluation of a high I/O package
    Subramanian, R
    Swaminathan, M
    Behar, M
    2ND ELECTRONICS PACKAGING TECHNOLOGY CONFERENCE, PROCEEDINGS, 1998, : 99 - 106
  • [44] PERFORMANCE EVALUATION OF MULTIPLE-DISK I/O SYSTEMS
    REDDY, ALN
    BANERJEE, P
    PROCEEDINGS OF THE 1989 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, VOL 1: ARCHITECTURE, 1989, : I315 - I318
  • [45] Evaluation of I/O Performance Regulating Function with a Virtual Machine
    Nagao, Takashi
    Tanabe, Nasanori
    Yokoyama, Kazutoshi
    Taniguchi, Hideo
    ADVANCES IN NETWORKED-BASED INFORMATION SYSTEMS, NBIS-2019, 2020, 1036 : 641 - 649
  • [46] An I/O Performance Evaluation of Varying CephFS Striping Patterns
    Biswas, Debasmita
    Neuwirth, Sarah
    Paul, Arnab K.
    Butt, Ali R.
    2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING WORKSHOPS, CLUSTER WORKSHOPS, 2023, : 25 - 31
  • [47] Performance modeling and evaluation of MPI-I/O on a cluster
    Barro, J
    Touriño, J
    Doallo, R
    Gulias, VM
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2002, 18 (05) : 825 - 836
  • [48] Considering I/O Processing in CloudSim for Performance and Energy Evaluation
    Ouarnoughi, Hamza
    Boukhobza, Jalil
    Singhoff, Frank
    Rubini, Stephane
    Kassis, Erwann
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2016 INTERNATIONAL WORKSHOPS, 2016, 9945 : 591 - 603
  • [49] PERFORMANCE EVALUATION OF A PARALLEL I/O SUBSYSTEM FOR HYPERCUBE MULTICOMPUTERS
    GHOSH, J
    GOVEAS, KD
    DRAPER, JT
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1993, 17 (1-2) : 90 - 106
  • [50] Educational Evaluation of Piano Performance by the Deep Learning Neural Network Model
    Liao, Yuanyuan
    MOBILE INFORMATION SYSTEMS, 2022, 2022