Computing infrastructure construction and optimization for high-performance computing and artificial intelligence

被引:5
|
作者
Su, Yun [1 ]
Zhou, Jipeng [1 ]
Ying, Jiangyong [1 ]
Zhou, Mingyao [1 ]
Zhou, Bin [1 ,2 ]
机构
[1] Huawei Technol Co Ltd, Shenzhen, Guangdong, Peoples R China
[2] Shandong Univ, Jinan, Shandong, Peoples R China
关键词
High-performance computing; Artificial intelligence; AI Processor; AI Computing Center; DEEP NEURAL-NETWORKS;
D O I
10.1007/s42514-021-00080-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of supercomputers has brought rapid development to human life and scientific research. Today, the new wave of artificial intelligence (AI) not only brings convenience to people's lives, but also changes the engineering and scientific high-performance computation. AI technologies provide more efficient and accurate computing methods for many fields. These ongoing changes pose new challenges to the design of computing infrastructures, which will be addressed in this survey in details. This survey first describes the distinguished progress of combining AI and high-performance computing (HPC) in scientific computation, analyzes several typical scenarios, and summarizes the characteristics of the corresponding requirements of computing resources. On this basis, this survey further lists four general methods for integrating AI computing with conventional HPC, as well as their key features and application scenarios. Finally, this survey introduces the design strategy of the Peng Cheng Cloud Brain II Supercomputing Center in improving AI computing capability and cluster communication efficiency, which helped it won the first place in the IO500 and AIPerf rankings.
引用
收藏
页码:331 / 343
页数:13
相关论文
共 50 条
  • [1] Computing infrastructure construction and optimization for high-performance computing and artificial intelligence
    Yun Su
    Jipeng Zhou
    Jiangyong Ying
    Mingyao Zhou
    Bin Zhou
    [J]. CCF Transactions on High Performance Computing, 2021, 3 : 331 - 343
  • [2] High-performance computing and artificial intelligence
    Ludwig, Thomas
    [J]. Informatik-Spektrum, 2023, 46 (03) : 129 - 130
  • [3] Accelerators for Artificial Intelligence and High-Performance Computing
    Milojicic, Dejan
    [J]. COMPUTER, 2020, 53 (02) : 14 - 22
  • [4] High-Performance Computing and Artificial Intelligence for Geosciences
    Wang, Yuzhu
    Jiang, Jinrong
    Wang, Yangang
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [5] Artificial Intelligence for Scientific Discovery at High-Performance Computing Scales
    Lee, Kin Long Kelvin
    Kumar, Nalini
    [J]. COMPUTER, 2023, 56 (04) : 116 - 122
  • [6] A secure communications infrastructure for high-performance distributed computing
    Foster, I
    Karonis, NT
    Kesselman, C
    Koenig, G
    Tuecke, S
    [J]. SIXTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 1997, : 125 - 136
  • [7] A continuous benchmarking infrastructure for high-performance computing applications
    Alt, Christoph
    Lanser, Martin
    Plewinski, Jonas
    Janki, Atin
    Klawonn, Axel
    Koestler, Harald
    Selzer, Michael
    Ruede, Ulrich
    [J]. INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2024, 39 (04) : 501 - 523
  • [8] Predicting Heterogeneity and Serverless Principles of Converged High-Performance Computing, Artificial Intelligence, and Workflows
    Bruel, Pedro
    Chalamalasetti, Sai Rahul
    Dhakal, Aditya
    Frachtenberg, Eitan
    Hogade, Ninad
    Enriquez, Rolando Pablo Hong
    Mishra, Alok
    Milojicic, Dejan
    Prakash, Pavana
    Rattihalli, Gourav
    [J]. COMPUTER, 2024, 57 (01) : 136 - 144
  • [9] TEPUI: High-Performance Computing Infrastructure for Beamlines at LNLS/Sirius
    Furusato, Fernando S.
    Sarmento, Matheus F.
    Aranha, Gustavo H. O.
    Zago, Luciano G.
    Miqueles, Eduardo X.
    [J]. HIGH PERFORMANCE COMPUTING, CARLA 2021, 2022, 1540 : 3 - 18
  • [10] TEPUI: High-Performance Computing Infrastructure for Beamlines at LNLS/Sirius
    Gitler, Isidoro
    Hernández, Carlos Jaime Barrios
    Meneses, Esteban
    Nesmachnow, Sergio
    Tchernykh, Andrei
    [J]. Communications in Computer and Information Science, 2022, 1540 CCIS