Understanding the Performance of Stencil Computations on Intel's Xeon Phi

被引:0
|
作者
Peraza, Joshua [1 ]
Tiwari, Ananta [2 ]
Laurenzano, Michael [2 ]
Carrington, Laura [2 ]
Ward, William A. [3 ]
Campbell, Roy [3 ]
机构
[1] Univ Calif San Diego, San Diego, CA 92103 USA
[2] San Diego Supercomp Ctr, La Jolla, CA 92093 USA
[3] United States Dept Def, High Performance Comp Modernizat Program, La Jolla, CA 92093 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Accelerators are becoming prevalent in high performance computing as a way of achieving increased computational capacity within a smaller power budget. Effectively utilizing the raw compute capacity made available by these systems, however, remains a challenge because it can require a substantial investment of programmer time to port and optimize code to effectively use novel accelerator hardware. In this paper we present a methodology for isolating and modeling the performance of common performance-critical patterns of code (so-called idioms) and other relevant behavioral characteristics from large scale HPC applications which are likely to perform favorably on Intel Xeon Phi. The benefits of the methodology are twofold: (1) it directs programmer efforts toward the regions of code most likely to benefit from porting to the Xeon Phi and (2) provides speedup estimates for porting those regions of code. We then apply the methodology to the stencil idiom, showing performance improvements of up to a factor of 4.7x on stencil-based benchmark codes.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Performance Evaluation of an OpenCL Implementation of the Lattice Boltzmann Method on the Intel Xeon Phi
    Obrecht, Christian
    Tourancheau, Bernard
    Kuznik, Frederic
    PARALLEL PROCESSING LETTERS, 2015, 25 (03)
  • [42] Offloading strategies for Stencil kernels on the KNC Xeon Phi architecture: Accuracy versus performance
    Hernandez, Mario
    Cebrian, Juan M.
    Cecilia, Jose M.
    Garcia, Jose M.
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 34 (02): : 199 - 207
  • [43] Code modernization strategies to 3-D Stencil-based applications on Intel Xeon Phi: KNC and KNL
    Cebrian, Juan M.
    Cecilia, Jose M.
    Hernandez, Mario
    Garcia, Jose M.
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2017, 74 (10) : 2557 - 2571
  • [44] Offload Compiler Runtime for the Intel® Xeon Phi™ Coprocessor
    Newburn, Chris J.
    Deodhar, Rajiv
    Dmitriev, Serguei
    Murty, Ravi
    Narayanaswamy, Ravi
    Wiegert, John
    Chinchilla, Francisco
    McGuire, Russell
    SUPERCOMPUTING (ISC 2013), 2013, 7905 : 239 - 254
  • [45] Intel® Xeon Phi™ coprocessor (codename Knights Corner)
    Chrysos, George
    2012 IEEE HOT CHIPS 24 SYMPOSIUM (HCS), 2012,
  • [46] Implementing Central Force Optimization on the Intel Xeon Phi
    Charest, Thomas
    Green, Robert C.
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 502 - 511
  • [47] Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor
    Lu, Mian
    Zhang, Lei
    Huynh Phung Huynh
    Ong, Zhongliang
    Liang, Yun
    He, Bingsheng
    Goh, Rick Siow Mong
    Richard Huynh
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [48] Effective Barrier Synchronization on Intel Xeon Phi Coprocessor
    Rodchenko, Andrey
    Nisbet, Andy
    Pop, Antoniu
    Lujan, Mikel
    EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 588 - 600
  • [49] Bent Functions Synthesis on Intel Xeon Phi Coprocessor
    Hrbacek, Radek
    MATHEMATICAL AND ENGINEERING METHODS IN COMPUTER SCIENCE, MEMICS 2014, 2014, 8934 : 88 - 99
  • [50] Retargeting of the Open Community Runtime to Intel Xeon Phi
    Dokulil, Jiri
    Benkner, Siegfried
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 1453 - 1462