TensorFlow Doing HPC An Evaluation of TensorFlow Performance in HPC Applications

被引:10
|
作者
Chien, Steven W. D. [1 ]
Markidis, Stefano [1 ]
Olshevsky, Vyacheslav [1 ]
Bulatov, Yaroslav [2 ]
Laure, Erwin [1 ]
Vetter, Jeffrey S. [3 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
[2] South Pk Commons, San Francisco, CA USA
[3] Oak Ridge Natl Lab, Oak Ridge, TN USA
关键词
TensorFlow; Emerging Programming Environments; Parallel Computing; Heterogeneous Supercomputers; HPC Applications;
D O I
10.1109/IPDPSW.2019.00092
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
TensorFlow is a popular emerging open-source programming framework supporting the execution of distributed applications on heterogeneous hardware. While TensorFlow has been initially designed for developing Machine Learning (ML) applications, in fact TensorFlow aims at supporting the development of a much broader range of application kinds that are outside the ML domain and can possibly include HPC applications. However, very few experiments have been conducted to evaluate TensorFlow performance when running HPC workloads on supercomputers. This work addresses this lack by designing four traditional HPC benchmark applications: STREAM, matrix-matrix multiply, Conjugate Gradient (CG) solver and Fast Fourier Transform (FFT). We analyze their performance on two supercomputers with accelerators and evaluate the potential of TensorFlow for developing HPC applications. Our tests show that TensorFlow can fully take advantage of high performance networks and accelerators on supercomputers. Running our Tensor-Flow STREAM benchmark, we obtain over 50% of theoretical communication bandwidth on our testing platform. We find an approximately 2x, 1.7x and 1.8x performance improvement when increasing the number of GPUs from two to four in the matrix-matrix multiply, CG and FFT applications respectively. All our performance results demonstrate that TensorFlow has high potential of emerging also as HPC programming framework for heterogeneous supercomputers.
引用
收藏
页码:509 / 518
页数:10
相关论文
共 50 条
  • [1] Performance analysis of different distribution of Python']Python and TensorFlow to efficiently utilize CPU on HPC Cluster
    Gupta, Krishan Gopal
    Maity, Samrit Kumar
    Das, Abhishek
    Wandhekar, Sanjay
    [J]. INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 1969 - 1974
  • [2] TensorFlow on state-of-the-art HPC clusters: a machine learning use case
    Ramirez-Gargallo, Guillem
    Garcia-Gasulla, Marta
    Mantovani, Filippo
    [J]. 2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 526 - 533
  • [3] Performance Evaluation of Hypervisors for HPC Applications
    Beserra, David
    Oliveira, Felipe
    Araujo, Jean
    Fernandes, Felipe
    Araujo, Alberto
    Endo, Patricia
    Maciel, Paulo
    Moreno, Edward David
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 846 - 851
  • [4] Evaluation of Performance Degradation in HPC Applications with VM Consolidation
    Hashimoto, Yuya
    Aida, Kento
    [J]. 2012 THIRD INTERNATIONAL CONFERENCE ON NETWORKING AND COMPUTING (ICNC 2012), 2012, : 273 - 277
  • [5] Performance modeling of HPC applications
    Snavely, A
    Lee, C
    Carrington, L
    Wolter, N
    Labarta, J
    Gimenez, J
    Jones, P
    [J]. PARALLEL COMPUTING: SOFTWARE TECHNOLOGY, ALGORITHMS, ARCHITECTURES AND APPLICATIONS, 2004, 13 : 777 - 784
  • [6] Performance Evaluation of Containers for HPC
    Ruiz, Cristian
    Jeanvoine, Emmanuel
    Nussbaum, Lucas
    [J]. EURO-PAR 2015: PARALLEL PROCESSING WORKSHOPS, 2015, 9523 : 813 - 824
  • [7] Fault Injection for TensorFlow Applications
    Narayanan, Niranjhana
    Chen, Zitao
    Fang, Bo
    Li, Guanpeng
    Pattabiraman, Karthik
    DeBardeleben, Nathan
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (04) : 2677 - 2695
  • [8] Performance and Energy Efficiency Evaluation for HPC Applications in Heterogeneous Architectures
    Kloh, Vinicius
    Yokoyama, Daniel
    Yokoyama, Andre
    Silva, Gabrieli
    Ferro, Mariza
    Schulze, Bruno
    [J]. 2018 SYMPOSIUM ON HIGH PERFORMANCE COMPUTING SYSTEMS (WSCAD 2018), 2018, : 162 - 169
  • [9] TensorFlow: A Vegetable Classification System and Its Performance Evaluation
    Ruedeeniraman, Natwadee
    Ikeda, Makoto
    Barolli, Leonard
    [J]. INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2019, 2020, 994 : 132 - 141
  • [10] Performance analysis of HPC applications in the cloud
    Exposito, Roberto R.
    Taboada, Guillermo L.
    Ramos, Sabela
    Tourino, Juan
    Doallo, Ramon
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2013, 29 (01): : 218 - 229