Efficient Parallel Multigrid Method on Intel Xeon Phi Clusters

被引:0
|
作者
Nakajima, Kengo [1 ]
Gerofi, Balazs [2 ]
Ishikawa, Yutaka [2 ]
Horikoshi, Masashi [3 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] RIKEN, R CCS, Kobe, Hyogo, Japan
[3] Intel Corp, Tokyo, Japan
来源
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION WORKSHOPS (HPC ASIA 2021 WORKSHOPS) | 2020年
关键词
parallel iterative solvers; multigrid; SELL-C-sigma; light weight kernel;
D O I
10.1145/3440722.3440882
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The parallel multigrid method is expected to play an important role in scientific computing on exa-scale supercomputer systems for solving large-scale linear equations with sparse matrices. Because solving sparse linear systems is a very memory-bound process, efficient method for storage of coefficient matrices is a crucial issue. In the previous works, authors implemented sliced ELL method to parallel conjugate gradient solvers with multigrid preconditioning (MGCG) for the application on 3D groundwater flow through heterogeneous porous media (pGW3D-FVM), and excellent performance has been obtained on large-scale multicore/manycore clusters. In the present work, authors introduced SELL-C-sigma to the MGCG solver, and evaluated the performance of the solver with various types of OpenMP/MPI hybrid parallel programing models on the Oakforest-PACS (OFP) system at JCAHPC using up to 1,024 nodes of Intel Xeon Phi. Because SELL-C-sigma is suitable for wide-SIMD architecture, such as Xeon Phi, improvement of the performance over the sliced ELL was more than 20%. This is one of the first examples of SELL-C-sigma applied to forward/backward substitutions in ILU-type smoother of multigrid solver. Furthermore, effects of IHK/McKernel has been investigated, and it achieved 11% improvement on 1,024 nodes.
引用
收藏
页码:46 / 49
页数:4
相关论文
共 50 条
  • [21] Modeling Parallel Processing of Databases on the Central Processor Intel Xeon Phi KNL
    Rekachinsky, A., I
    Chulkevich, R. A.
    Kostenetskiy, P. S.
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 1605 - 1610
  • [22] Parallel evolutionary approaches for game playing and verification using Intel Xeon Phi
    Rodriguez, Sebastian
    Parodi, Facundo
    Nesmachnow, Sergio
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 133 : 258 - 271
  • [23] Evaluation of Rodinia Codes on Intel Xeon Phi
    Misra, Goldi
    Kurkure, Nisha
    Das, Abhishek
    Valmiki, Manjunatha
    Das, Shweta
    Gupta, Abhinav
    FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS 2013), 2013, : 415 - 419
  • [24] Lattice QCD on Intel® Xeon Phi™ Coprocessors
    Joo, Balint
    Kalamkar, Dhiraj D.
    Vaidyanathan, Karthikeyan
    Smelyanskiy, Mikhail
    Pamnany, Kiran
    Lee, Victor W.
    Dubey, Pradeep
    Watson, William, III
    SUPERCOMPUTING (ISC 2013), 2013, 7905 : 40 - 54
  • [25] Porting to the Intel Xeon Phi: Opportunities and Challenges
    Rosales, C.
    2013 EXTREME SCALING WORKSHOP (XSW 2013), 2014, : 1 - 7
  • [26] Improving Communication Performance and Scalability of Native Applications on Intel® Xeon Phi™ Coprocessor Clusters
    Vaidyanathan, Karthikeyan
    Pamnany, Kiran
    Kalamkar, Dhiraj D.
    Heinecke, Alexander
    Smelyanskiy, Mikhail
    Park, Jongsoo
    Kim, Daehyun
    Shet, Aniruddha G.
    Kaul, Bharat
    Joo, Balint
    Dubey, Pradeep
    2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [27] Exploring SIMD for Molecular Dynamics, Using Intel®Xeon®Processors and Intel®Xeon Phi™ Coprocessors
    Pennycook, S. J.
    Hughes, C. J.
    Smelyanskiy, M.
    Jarvis, S. A.
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 1085 - 1097
  • [28] Behavior of MDynaMix on Intel Xeon Phi Coprocessor
    Valmiki, Manjunatha
    Kurkure, Nisha
    Das, Shweta
    Dinde, Prashant
    Deepu, C., V
    Misra, Goldi
    Sinha, Pradeep
    2013 FIRST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION (AIMS 2013), 2013, : 387 - 392
  • [29] Optimizing Performance of ROMS on Intel Xeon Phi
    Bhaskaran, Gopal
    Gaurav, Pratyush
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 2854 - 2858
  • [30] Biosequence Analysis using Intel® Xeon Phi
    Sinha, Pradeep
    Misra, Goldi
    Vikraman, Deepu
    Das, Abhishek
    Desai, Shraddha
    Pawar, Sucheta
    Shewale, Kalyani
    UKSIM-AMSS SEVENTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2013), 2013, : 497 - 499