Understanding the Performance of Stencil Computations on Intel's Xeon Phi

被引:0
|
作者
Peraza, Joshua [1 ]
Tiwari, Ananta [2 ]
Laurenzano, Michael [2 ]
Carrington, Laura [2 ]
Ward, William A. [3 ]
Campbell, Roy [3 ]
机构
[1] Univ Calif San Diego, San Diego, CA 92103 USA
[2] San Diego Supercomp Ctr, La Jolla, CA 92093 USA
[3] United States Dept Def, High Performance Comp Modernizat Program, La Jolla, CA 92093 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Accelerators are becoming prevalent in high performance computing as a way of achieving increased computational capacity within a smaller power budget. Effectively utilizing the raw compute capacity made available by these systems, however, remains a challenge because it can require a substantial investment of programmer time to port and optimize code to effectively use novel accelerator hardware. In this paper we present a methodology for isolating and modeling the performance of common performance-critical patterns of code (so-called idioms) and other relevant behavioral characteristics from large scale HPC applications which are likely to perform favorably on Intel Xeon Phi. The benefits of the methodology are twofold: (1) it directs programmer efforts toward the regions of code most likely to benefit from porting to the Xeon Phi and (2) provides speedup estimates for porting those regions of code. We then apply the methodology to the stencil idiom, showing performance improvements of up to a factor of 4.7x on stencil-based benchmark codes.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Alya Multiphysics Simulations on Intel's Xeon Phi Accelerators
    Vazquez, Mariano
    Houzeaux, Guillaume
    Rubio, Felix
    Simarro, Christian
    HIGH PERFORMANCE COMPUTING, CARLA 2014, 2014, 485 : 248 - 254
  • [22] Benchmarking Parallel Chess Search in Stockfish on Intel Xeon and Intel Xeon Phi Processors
    Czarnul, Pawel
    COMPUTATIONAL SCIENCE - ICCS 2018, PT III, 2018, 10862 : 457 - 464
  • [23] Performance Evaluation of Sparse Matrix Multiplication Kernels on Intel Xeon Phi
    Saule, Erik
    Kaya, Kamer
    Catalyuerek, Uemit V.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 559 - 570
  • [24] The Power-Performance Tradeoffs of the Intel Xeon Phi on HPC Applications
    Li, Bo
    Chang, Hung-Ching
    Song, Shuaiwen Leon
    Su, Chun-Yi
    Meyer, Timmy
    Mooring, John
    Cameron, Kirk
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 1449 - 1457
  • [25] Evaluation of Rodinia Codes on Intel Xeon Phi
    Misra, Goldi
    Kurkure, Nisha
    Das, Abhishek
    Valmiki, Manjunatha
    Das, Shweta
    Gupta, Abhinav
    FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS 2013), 2013, : 415 - 419
  • [26] Application Performance on Intel Xeon Phi - Being Prepared for KNL and Beyond
    Gerber, Richard A.
    Milfeld, Kent
    Newburn, Chris J.
    Steinke, Thomas
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2016 INTERNATIONAL WORKSHOPS, 2016, 9945 : 304 - 306
  • [27] Lattice QCD on Intel® Xeon Phi™ Coprocessors
    Joo, Balint
    Kalamkar, Dhiraj D.
    Vaidyanathan, Karthikeyan
    Smelyanskiy, Mikhail
    Pamnany, Kiran
    Lee, Victor W.
    Dubey, Pradeep
    Watson, William, III
    SUPERCOMPUTING (ISC 2013), 2013, 7905 : 40 - 54
  • [28] Porting to the Intel Xeon Phi: Opportunities and Challenges
    Rosales, C.
    2013 EXTREME SCALING WORKSHOP (XSW 2013), 2014, : 1 - 7
  • [29] Exploring SIMD for Molecular Dynamics, Using Intel®Xeon®Processors and Intel®Xeon Phi™ Coprocessors
    Pennycook, S. J.
    Hughes, C. J.
    Smelyanskiy, M.
    Jarvis, S. A.
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 1085 - 1097
  • [30] Behavior of MDynaMix on Intel Xeon Phi Coprocessor
    Valmiki, Manjunatha
    Kurkure, Nisha
    Das, Shweta
    Dinde, Prashant
    Deepu, C., V
    Misra, Goldi
    Sinha, Pradeep
    2013 FIRST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION (AIMS 2013), 2013, : 387 - 392