Toward Parallel Modeling of Solidification Based on the Generalized Finite Difference Method Using Intel Xeon Phi

被引:5
|
作者
Szustak, Lukasz [1 ]
Halbiniak, Kamil [1 ]
Kulawik, Adam [1 ]
Wrobel, Joanna [1 ]
Gepner, Pawel [2 ]
机构
[1] Czestochowa Tech Univ, Dabrowskiego 69, PL-42201 Czestochowa, Poland
[2] Intel Corp, Swindon, Wilts, England
来源
PARALLEL PROCESSING AND APPLIED MATHEMATICS, PPAM 2015, PT I | 2016年 / 9573卷
关键词
Intel Xeon Phi; Numerical model of solidification; Application porting; Optimization of data movements;
D O I
10.1007/978-3-319-32149-3_39
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Modern heterogeneous computing platforms have become powerful HPC solutions, which could be applied for a wide range of applications. In particular, the hybrid platforms equipped with Intel Xeon Phi coprocessors offers performance advantages over conventional homogeneous solutions based on CPUs, while supporting practically the same parallel programming model. However, there is still an open issue how scientific applications can utilize efficiently the hybrid platforms equipped with Intel coprocessors. In this paper we propose a method for porting a real-life scientific application to computing platforms with Intel Xeon Phi. We focus on the parallel implementation of a numerical model of solidification, which is based on the generalized finite difference method. We develop a sequence of steps that are necessary for porting this application to platforms with accelerators, assuming no significant modifications of the code. The proposed method considers not only efficient data transfers that allow for overlapping computations with data movements, but also takes into account an adequate utilization of cores/threads and vector units. The developed approach allows us to execute the whole application 3.45 times faster than the original parallel version running on two CPUs.
引用
收藏
页码:411 / 422
页数:12
相关论文
共 50 条
  • [1] Efficient Parallel Multigrid Method on Intel Xeon Phi Clusters
    Nakajima, Kengo
    Gerofi, Balazs
    Ishikawa, Yutaka
    Horikoshi, Masashi
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION WORKSHOPS (HPC ASIA 2021 WORKSHOPS), 2020, : 46 - 49
  • [2] Empirical Mode Decomposition for Modeling of Parallel Applications on Intel Xeon Phi Processors
    Lawson, Gary
    Sosonkina, Masha
    Ezer, Tal
    Shen, Yuzhong
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 1000 - 1008
  • [3] Modeling Parallel Processing of Databases on the Central Processor Intel Xeon Phi KNL
    Rekachinsky, A., I
    Chulkevich, R. A.
    Kostenetskiy, P. S.
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 1605 - 1610
  • [4] Numerical modeling of casting solidification using generalized finite difference method
    Mochnacki, Bohdan
    Majchrzak, Ewa
    THERMEC 2009, PTS 1-4, 2010, 638-642 : 2676 - 2681
  • [5] Parallel evolutionary approaches for game playing and verification using Intel Xeon Phi
    Rodriguez, Sebastian
    Parodi, Facundo
    Nesmachnow, Sergio
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 133 : 258 - 271
  • [6] Directive-based Auto-tuning for the Finite Difference Method on the Xeon Phi
    Katagiri, Takahiro
    Ohshima, Satoshi
    Matsumoto, Masaharu
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 1221 - 1230
  • [7] Tuning up TVD HOPMOC method on Intel MIC Xeon Phi Architectures with Intel Parallel Studio Tools
    Cabral, Frederico L.
    Osthoff, Carla
    Costa, Gabriel P.
    Brandao, Diego
    Kischinhevsky, Mauricio
    Gonzaga de Oliveira, Sanderson L.
    2017 INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW), 2017, : 19 - 24
  • [8] Optimized Parallel Label Propagation based Community Detection on the Intel® Xeon Phi™ Architecture
    Khlopotine, Andrei B.
    Sathanur, Arun V.
    Jandhyala, Vikram
    2015 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 2015, : 9 - 16
  • [9] Improvement of workload balancing using parallel loop self-scheduling on Intel Xeon Phi
    Chao-Tung Yang
    Chao-Wei Huang
    Shuo-Tsung Chen
    The Journal of Supercomputing, 2017, 73 : 4981 - 5005
  • [10] Improvement of workload balancing using parallel loop self-scheduling on Intel Xeon Phi
    Yang, Chao-Tung
    Huang, Chao-Wei
    Chen, Shuo-Tsung
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (11): : 4981 - 5005