Efficient Parallel Multigrid Method on Intel Xeon Phi Clusters

被引:0
|
作者
Nakajima, Kengo [1 ]
Gerofi, Balazs [2 ]
Ishikawa, Yutaka [2 ]
Horikoshi, Masashi [3 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] RIKEN, R CCS, Kobe, Hyogo, Japan
[3] Intel Corp, Tokyo, Japan
来源
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION WORKSHOPS (HPC ASIA 2021 WORKSHOPS) | 2020年
关键词
parallel iterative solvers; multigrid; SELL-C-sigma; light weight kernel;
D O I
10.1145/3440722.3440882
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The parallel multigrid method is expected to play an important role in scientific computing on exa-scale supercomputer systems for solving large-scale linear equations with sparse matrices. Because solving sparse linear systems is a very memory-bound process, efficient method for storage of coefficient matrices is a crucial issue. In the previous works, authors implemented sliced ELL method to parallel conjugate gradient solvers with multigrid preconditioning (MGCG) for the application on 3D groundwater flow through heterogeneous porous media (pGW3D-FVM), and excellent performance has been obtained on large-scale multicore/manycore clusters. In the present work, authors introduced SELL-C-sigma to the MGCG solver, and evaluated the performance of the solver with various types of OpenMP/MPI hybrid parallel programing models on the Oakforest-PACS (OFP) system at JCAHPC using up to 1,024 nodes of Intel Xeon Phi. Because SELL-C-sigma is suitable for wide-SIMD architecture, such as Xeon Phi, improvement of the performance over the sliced ELL was more than 20%. This is one of the first examples of SELL-C-sigma applied to forward/backward substitutions in ILU-type smoother of multigrid solver. Furthermore, effects of IHK/McKernel has been investigated, and it achieved 11% improvement on 1,024 nodes.
引用
收藏
页码:46 / 49
页数:4
相关论文
共 50 条
  • [41] Understanding Data Analytics Workloads on Intel®Xeon Phi™
    Xie, Biwei
    Liu, Xu
    Mckee, Sally A.
    Zhan, Jianfeng
    Jia, Zhen
    Wang, Lei
    Zhang, Lixin
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 206 - 215
  • [42] Performance Evaluation of an OpenCL Implementation of the Lattice Boltzmann Method on the Intel Xeon Phi
    Obrecht, Christian
    Tourancheau, Bernard
    Kuznik, Frederic
    PARALLEL PROCESSING LETTERS, 2015, 25 (03)
  • [43] Intel Xeon Phi Coprocessor High Performance Programming
    More, Andres
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2013, 13 (02): : 105 - 106
  • [44] Offload Compiler Runtime for the Intel® Xeon Phi™ Coprocessor
    Newburn, Chris J.
    Deodhar, Rajiv
    Dmitriev, Serguei
    Murty, Ravi
    Narayanaswamy, Ravi
    Wiegert, John
    Chinchilla, Francisco
    McGuire, Russell
    SUPERCOMPUTING (ISC 2013), 2013, 7905 : 239 - 254
  • [45] A survey on evaluating and optimizing performance of Intel Xeon Phi
    Mittal, Sparsh
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (19):
  • [46] Intel® Xeon Phi™ coprocessor (codename Knights Corner)
    Chrysos, George
    2012 IEEE HOT CHIPS 24 SYMPOSIUM (HCS), 2012,
  • [47] Implementing Central Force Optimization on the Intel Xeon Phi
    Charest, Thomas
    Green, Robert C.
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 502 - 511
  • [48] Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor
    Lu, Mian
    Zhang, Lei
    Huynh Phung Huynh
    Ong, Zhongliang
    Liang, Yun
    He, Bingsheng
    Goh, Rick Siow Mong
    Richard Huynh
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [49] Effective Barrier Synchronization on Intel Xeon Phi Coprocessor
    Rodchenko, Andrey
    Nisbet, Andy
    Pop, Antoniu
    Lujan, Mikel
    EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 588 - 600
  • [50] Analysis of Optimization opportunities for Intel® Xeon Phi™ and Intel® Xeon Scalable Processors™ Environments of a Numerical Method for the Biphasic Flow of Fluids in Porous Media
    Teixeira, Thiago
    Cabral, Frederico L.
    Osthoff, Carla
    Souto, Roberto P.
    Borges, Marcio R.
    2018 SYMPOSIUM ON HIGH PERFORMANCE COMPUTING SYSTEMS (WSCAD 2018), 2018, : 237 - 242