HPC Software for Massive Analysis of the Parallel Efficiency of Applications

被引:2
|
作者
Shvets, Pavel [1 ]
Voevodin, Vadim [1 ]
Zhumatiy, Sergey [1 ]
机构
[1] Lomonosov Moscow State Univ, Res Comp Ctr, Leninskie Gory 1,Bld 4, Moscow, Russia
基金
俄罗斯科学基金会;
关键词
Supercomputing; Efficiency analysis; Parallel program; Massive analysis; Application performance;
D O I
10.1007/978-3-030-28163-2_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficiency is a major weakness in modern supercomputers. Low efficiency of user applications is one of the main reasons for that. There are many software tools for analyzing and improving the performance of parallel applications. However, supercomputer users often do not have sufficient knowledge and skills to apply these tools correctly in their specific case. Moreover, users often do not know that their applications work inefficiently. The main goal of our project is to help any HPC user to detect performance flaws in their applications and find out how to deal with them. To this end, we plan to develop an open-source software solution that performs automatic massive analysis of all jobs running on a supercomputer to identify those with efficiency issues and helps users to conduct a detailed analysis of an individual program (using existing software tools) to identify and eliminate the root causes of the loss of efficiency.
引用
收藏
页码:3 / 18
页数:16
相关论文
共 50 条
  • [41] Towards a HPC-oriented parallel implementation of a learning algorithm for bioinformatics applications
    D'Angelo, Gianni
    Rampone, Salvatore
    BMC BIOINFORMATICS, 2014, 15
  • [42] Third international workshop on software engineering for high performance computing (HPC) applications
    Carver, Jeffrey C.
    29th International Conference on Software Engineering: ICSE 2007 Companion Volume, Proceedings, 2007, : 147 - 147
  • [43] Identifying Representative Regions of Parallel HPC Applications: a Cross-architectural Evaluation
    Ferreron, Alexandra
    Jagtap, Radhika
    Rusitoru, Roxana
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, 2016, : 223 - 224
  • [44] Leveraging Comprehensive Data Analysis to Inform Parallel HPC Workloads
    Dwyer, Matthew
    Kaff, Nicole
    Cohen, Jacob
    Frauenhoffer, Michael
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 3960 - 3967
  • [45] CRC-based Memory Reliability for Task-parallel HPC Applications
    Subasi, Omer
    Unsal, Osman
    Labarta, Jesus
    Yalcin, Gulay
    Cristal, Adrian
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 1101 - 1112
  • [46] BeeSwarm: Enabling Parallel Scaling Performance Measurement in Continuous Integration for HPC Applications
    Tronge, Jake
    Chen, Jieyang
    Grubel, Patricia
    Randles, Tim
    Davis, Rusty
    Wofford, Quincy
    Anaya, Steven
    Guan, Qiang
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 1136 - 1140
  • [47] A Parallel Algorithm for Energy Efficiency Maximization in Massive MIMO Networks
    Yang, Yang
    Pesavento, Marius
    2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [48] Massive Parallel Algorithms for Software GNSS Signal Simulation using GPU
    Bartunkova, Iva
    Eissfeller, Bernd
    PROCEEDINGS OF THE 25TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS 2012), 2012, : 118 - 126
  • [49] A Secured Large Heterogeneous HPC Cluster System using Massive Parallel Programming Model with Accelerated GPUs
    Alsubhi, Khalid
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 370 - 376
  • [50] Analysis and Simulation of HPC Applications in Virtualized Data Centers
    Takouna, Ibrahim
    Dawoud, Wesam
    Meinel, Christoph
    2012 IEEE INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND COMMUNICATIONS, CONFERENCE ON INTERNET OF THINGS, AND CONFERENCE ON CYBER, PHYSICAL AND SOCIAL COMPUTING (GREENCOM 2012), 2012, : 498 - 507