Toward Parallel Modeling of Solidification Based on the Generalized Finite Difference Method Using Intel Xeon Phi

被引：5

作者：

Szustak, Lukasz ^{[1
]}

Halbiniak, Kamil ^{[1
]}

Kulawik, Adam ^{[1
]}

Wrobel, Joanna ^{[1
]}

Gepner, Pawel ^{[2
]}

机构：

[1] Czestochowa Tech Univ, Dabrowskiego 69, PL-42201 Czestochowa, Poland

[2] Intel Corp, Swindon, Wilts, England

来源：

PARALLEL PROCESSING AND APPLIED MATHEMATICS, PPAM 2015, PT I | 2016年 / 9573卷

关键词：

Intel Xeon Phi; Numerical model of solidification; Application porting; Optimization of data movements;

D O I：

10.1007/978-3-319-32149-3_39

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Modern heterogeneous computing platforms have become powerful HPC solutions, which could be applied for a wide range of applications. In particular, the hybrid platforms equipped with Intel Xeon Phi coprocessors offers performance advantages over conventional homogeneous solutions based on CPUs, while supporting practically the same parallel programming model. However, there is still an open issue how scientific applications can utilize efficiently the hybrid platforms equipped with Intel coprocessors. In this paper we propose a method for porting a real-life scientific application to computing platforms with Intel Xeon Phi. We focus on the parallel implementation of a numerical model of solidification, which is based on the generalized finite difference method. We develop a sequence of steps that are necessary for porting this application to platforms with accelerators, assuming no significant modifications of the code. The proposed method considers not only efficient data transfers that allow for overlapping computations with data movements, but also takes into account an adequate utilization of cores/threads and vector units. The developed approach allows us to execute the whole application 3.45 times faster than the original parallel version running on two CPUs.

引用

页码：411 / 422

页数：12

共 50 条

[1] Efficient Parallel Multigrid Method on Intel Xeon Phi Clusters
Nakajima, Kengo
Gerofi, Balazs
Ishikawa, Yutaka
Horikoshi, Masashi
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION WORKSHOPS (HPC ASIA 2021 WORKSHOPS), 2020, : 46 - 49
[2] Empirical Mode Decomposition for Modeling of Parallel Applications on Intel Xeon Phi Processors
Lawson, Gary
Sosonkina, Masha
Ezer, Tal
Shen, Yuzhong
2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 1000 - 1008
[3] Modeling Parallel Processing of Databases on the Central Processor Intel Xeon Phi KNL
Rekachinsky, A., I
Chulkevich, R. A.
Kostenetskiy, P. S.
2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 1605 - 1610
[4] Numerical modeling of casting solidification using generalized finite difference method
Mochnacki, Bohdan
Majchrzak, Ewa
THERMEC 2009, PTS 1-4, 2010, 638-642 : 2676 - 2681
[5] Parallel evolutionary approaches for game playing and verification using Intel Xeon Phi
Rodriguez, Sebastian
Parodi, Facundo
Nesmachnow, Sergio
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 133 : 258 - 271
[6] Directive-based Auto-tuning for the Finite Difference Method on the Xeon Phi
Katagiri, Takahiro
Ohshima, Satoshi
Matsumoto, Masaharu
2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 1221 - 1230
[7] Tuning up TVD HOPMOC method on Intel MIC Xeon Phi Architectures with Intel Parallel Studio Tools
Cabral, Frederico L.
Osthoff, Carla
Costa, Gabriel P.
Brandao, Diego
Kischinhevsky, Mauricio
Gonzaga de Oliveira, Sanderson L.
2017 INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW), 2017, : 19 - 24
[8] Optimized Parallel Label Propagation based Community Detection on the Intel® Xeon Phi™ Architecture
Khlopotine, Andrei B.
Sathanur, Arun V.
Jandhyala, Vikram
2015 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 2015, : 9 - 16
[9] Improvement of workload balancing using parallel loop self-scheduling on Intel Xeon Phi
Chao-Tung Yang
Chao-Wei Huang
Shuo-Tsung Chen
The Journal of Supercomputing, 2017, 73 : 4981 - 5005
[10] Improvement of workload balancing using parallel loop self-scheduling on Intel Xeon Phi
Yang, Chao-Tung
Huang, Chao-Wei
Chen, Shuo-Tsung
JOURNAL OF SUPERCOMPUTING, 2017, 73 (11): : 4981 - 5005

← 1 2 3 4 5 →