Understanding the Performance of Stencil Computations on Intel's Xeon Phi

被引：0

作者：

Peraza, Joshua ^{[1
]}

Tiwari, Ananta ^{[2
]}

Laurenzano, Michael ^{[2
]}

Carrington, Laura ^{[2
]}

Ward, William A. ^{[3
]}

Campbell, Roy ^{[3
]}

机构：

[1] Univ Calif San Diego, San Diego, CA 92103 USA

[2] San Diego Supercomp Ctr, La Jolla, CA 92093 USA

[3] United States Dept Def, High Performance Comp Modernizat Program, La Jolla, CA 92093 USA

来源：

2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) | 2013年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Accelerators are becoming prevalent in high performance computing as a way of achieving increased computational capacity within a smaller power budget. Effectively utilizing the raw compute capacity made available by these systems, however, remains a challenge because it can require a substantial investment of programmer time to port and optimize code to effectively use novel accelerator hardware. In this paper we present a methodology for isolating and modeling the performance of common performance-critical patterns of code (so-called idioms) and other relevant behavioral characteristics from large scale HPC applications which are likely to perform favorably on Intel Xeon Phi. The benefits of the methodology are twofold: (1) it directs programmer efforts toward the regions of code most likely to benefit from porting to the Xeon Phi and (2) provides speedup estimates for porting those regions of code. We then apply the methodology to the stencil idiom, showing performance improvements of up to a factor of 4.7x on stencil-based benchmark codes.

引用

页数：5

共 50 条

[41] Performance Evaluation of an OpenCL Implementation of the Lattice Boltzmann Method on the Intel Xeon Phi
Obrecht, Christian
Tourancheau, Bernard
Kuznik, Frederic
PARALLEL PROCESSING LETTERS, 2015, 25 (03)
[42] Offloading strategies for Stencil kernels on the KNC Xeon Phi architecture: Accuracy versus performance
Hernandez, Mario
Cebrian, Juan M.
Cecilia, Jose M.
Garcia, Jose M.
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 34 (02): : 199 - 207
[43] Code modernization strategies to 3-D Stencil-based applications on Intel Xeon Phi: KNC and KNL
Cebrian, Juan M.
Cecilia, Jose M.
Hernandez, Mario
Garcia, Jose M.
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2017, 74 (10) : 2557 - 2571
[44] Offload Compiler Runtime for the Intel® Xeon Phi™ Coprocessor
Newburn, Chris J.
Deodhar, Rajiv
Dmitriev, Serguei
Murty, Ravi
Narayanaswamy, Ravi
Wiegert, John
Chinchilla, Francisco
McGuire, Russell
SUPERCOMPUTING (ISC 2013), 2013, 7905 : 239 - 254
[45] Intel® Xeon Phi™ coprocessor (codename Knights Corner)
Chrysos, George
2012 IEEE HOT CHIPS 24 SYMPOSIUM (HCS), 2012,
[46] Implementing Central Force Optimization on the Intel Xeon Phi
Charest, Thomas
Green, Robert C.
2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 502 - 511
[47] Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor
Lu, Mian
Zhang, Lei
Huynh Phung Huynh
Ong, Zhongliang
Liang, Yun
He, Bingsheng
Goh, Rick Siow Mong
Richard Huynh
2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
[48] Effective Barrier Synchronization on Intel Xeon Phi Coprocessor
Rodchenko, Andrey
Nisbet, Andy
Pop, Antoniu
Lujan, Mikel
EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 588 - 600
[49] Bent Functions Synthesis on Intel Xeon Phi Coprocessor
Hrbacek, Radek
MATHEMATICAL AND ENGINEERING METHODS IN COMPUTER SCIENCE, MEMICS 2014, 2014, 8934 : 88 - 99
[50] Retargeting of the Open Community Runtime to Intel Xeon Phi
Dokulil, Jiri
Benkner, Siegfried
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 1453 - 1462

← 1 2 3 4 5 →