Efficient Parallel Multigrid Method on Intel Xeon Phi Clusters

被引：0

作者：

Nakajima, Kengo ^{[1
]}

Gerofi, Balazs ^{[2
]}

Ishikawa, Yutaka ^{[2
]}

Horikoshi, Masashi ^{[3
]}

机构：

[1] Univ Tokyo, Tokyo, Japan

[2] RIKEN, R CCS, Kobe, Hyogo, Japan

[3] Intel Corp, Tokyo, Japan

来源：

PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION WORKSHOPS (HPC ASIA 2021 WORKSHOPS) | 2020年

关键词：

parallel iterative solvers; multigrid; SELL-C-sigma; light weight kernel;

D O I：

10.1145/3440722.3440882

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The parallel multigrid method is expected to play an important role in scientific computing on exa-scale supercomputer systems for solving large-scale linear equations with sparse matrices. Because solving sparse linear systems is a very memory-bound process, efficient method for storage of coefficient matrices is a crucial issue. In the previous works, authors implemented sliced ELL method to parallel conjugate gradient solvers with multigrid preconditioning (MGCG) for the application on 3D groundwater flow through heterogeneous porous media (pGW3D-FVM), and excellent performance has been obtained on large-scale multicore/manycore clusters. In the present work, authors introduced SELL-C-sigma to the MGCG solver, and evaluated the performance of the solver with various types of OpenMP/MPI hybrid parallel programing models on the Oakforest-PACS (OFP) system at JCAHPC using up to 1,024 nodes of Intel Xeon Phi. Because SELL-C-sigma is suitable for wide-SIMD architecture, such as Xeon Phi, improvement of the performance over the sliced ELL was more than 20%. This is one of the first examples of SELL-C-sigma applied to forward/backward substitutions in ILU-type smoother of multigrid solver. Furthermore, effects of IHK/McKernel has been investigated, and it achieved 11% improvement on 1,024 nodes.

引用

页码：46 / 49

页数：4

共 50 条

[21] Modeling Parallel Processing of Databases on the Central Processor Intel Xeon Phi KNL
Rekachinsky, A., I
Chulkevich, R. A.
Kostenetskiy, P. S.
2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 1605 - 1610
[22] Parallel evolutionary approaches for game playing and verification using Intel Xeon Phi
Rodriguez, Sebastian
Parodi, Facundo
Nesmachnow, Sergio
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 133 : 258 - 271
[23] Evaluation of Rodinia Codes on Intel Xeon Phi
Misra, Goldi
Kurkure, Nisha
Das, Abhishek
Valmiki, Manjunatha
Das, Shweta
Gupta, Abhinav
FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS 2013), 2013, : 415 - 419
[24] Lattice QCD on Intel® Xeon Phi™ Coprocessors
Joo, Balint
Kalamkar, Dhiraj D.
Vaidyanathan, Karthikeyan
Smelyanskiy, Mikhail
Pamnany, Kiran
Lee, Victor W.
Dubey, Pradeep
Watson, William, III
SUPERCOMPUTING (ISC 2013), 2013, 7905 : 40 - 54
[25] Porting to the Intel Xeon Phi: Opportunities and Challenges
Rosales, C.
2013 EXTREME SCALING WORKSHOP (XSW 2013), 2014, : 1 - 7
[26] Improving Communication Performance and Scalability of Native Applications on Intel® Xeon Phi™ Coprocessor Clusters
Vaidyanathan, Karthikeyan
Pamnany, Kiran
Kalamkar, Dhiraj D.
Heinecke, Alexander
Smelyanskiy, Mikhail
Park, Jongsoo
Kim, Daehyun
Shet, Aniruddha G.
Kaul, Bharat
Joo, Balint
Dubey, Pradeep
2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
[27] Exploring SIMD for Molecular Dynamics, Using Intel®Xeon®Processors and Intel®Xeon Phi™ Coprocessors
Pennycook, S. J.
Hughes, C. J.
Smelyanskiy, M.
Jarvis, S. A.
IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 1085 - 1097
[28] Behavior of MDynaMix on Intel Xeon Phi Coprocessor
Valmiki, Manjunatha
Kurkure, Nisha
Das, Shweta
Dinde, Prashant
Deepu, C., V
Misra, Goldi
Sinha, Pradeep
2013 FIRST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION (AIMS 2013), 2013, : 387 - 392
[29] Optimizing Performance of ROMS on Intel Xeon Phi
Bhaskaran, Gopal
Gaurav, Pratyush
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 2854 - 2858
[30] Biosequence Analysis using Intel® Xeon Phi
Sinha, Pradeep
Misra, Goldi
Vikraman, Deepu
Das, Abhishek
Desai, Shraddha
Pawar, Sucheta
Shewale, Kalyani
UKSIM-AMSS SEVENTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2013), 2013, : 497 - 499

← 1 2 3 4 5 →