Lattice QCD on Intel® Xeon Phi™ Coprocessors

被引:0
|
作者
Joo, Balint [1 ]
Kalamkar, Dhiraj D. [2 ]
Vaidyanathan, Karthikeyan [2 ]
Smelyanskiy, Mikhail [3 ]
Pamnany, Kiran [2 ]
Lee, Victor W. [3 ]
Dubey, Pradeep [3 ]
Watson, William, III [1 ]
机构
[1] Thomas Jefferson Natl Accelerator Facil, Newport News, VA 23606 USA
[2] Intel Corp, Parallel Comp Lab, Bangalore, Karnataka, India
[3] Intel Corp, Parallel Comp Lab, Santa Clara, CA USA
来源
SUPERCOMPUTING (ISC 2013) | 2013年 / 7905卷
关键词
SOLVERS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Lattice Quantum Chromodynamics (LQCD) is currently the only known model independent, non perturbative computational method for calculations in the theory of the strong interactions, and is of importance in studies of nuclear and high energy physics. LQCD codes use large fractions of supercomputing cycles worldwide and are often amongst the first to be ported to new high performance computing architectures. The recently released Intel Xeon Phi architecture from Intel Corporation features parallelism at the level of many x86-based cores, multiple threads per core, and vector processing units. In this contribution, we describe our experiences with optimizing a key LQCD kernel for the Xeon Phi architecture. On a single node, using single precision, our Dslash kernel sustains a performance of up to 320 GFLOPS, while our Conjugate Gradients solver sustains up to 237 GFLOPS. Furthermore we demonstrate a fully ' native' multi-node LQCD implementation running entirely on KNC nodes with minimum involvement of the host CPU. Our multi-node implementation of the solver has been strong scaled to 3.9 TFLOPS on 32 KNCs.
引用
收藏
页码:40 / 54
页数:15
相关论文
共 50 条
  • [21] Utilizing Multiple Xeon Phi Coprocessors on One Compute Node
    Dong, Xinnan
    Chai, Jun
    Yang, Jing
    Wen, Mei
    Wu, Nan
    Cai, Xing
    Zhang, Chunyuan
    Chen, Zhaoyun
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT II, 2014, 8631 : 68 - 81
  • [22] Porting to the Intel Xeon Phi: Opportunities and Challenges
    Rosales, C.
    [J]. 2013 EXTREME SCALING WORKSHOP (XSW 2013), 2014, : 1 - 7
  • [23] Tera-Scale 1D FFT with Low-Communication Algorithm and Intel® Xeon Phi™ Coprocessors
    Park, Jongsoo
    Bikshandi, Ganesh
    Vaidyanathan, Karthikeyan
    Tang, Ping Tak Peter
    Dubey, Pradeep
    Kim, Daehyun
    [J]. 2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [24] Biosequence Analysis using Intel® Xeon Phi
    Sinha, Pradeep
    Misra, Goldi
    Vikraman, Deepu
    Das, Abhishek
    Desai, Shraddha
    Pawar, Sucheta
    Shewale, Kalyani
    [J]. UKSIM-AMSS SEVENTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2013), 2013, : 497 - 499
  • [25] Behavior of MDynaMix on Intel Xeon Phi Coprocessor
    Valmiki, Manjunatha
    Kurkure, Nisha
    Das, Shweta
    Dinde, Prashant
    Deepu, C., V
    Misra, Goldi
    Sinha, Pradeep
    [J]. 2013 FIRST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION (AIMS 2013), 2013, : 387 - 392
  • [26] Optimizing Performance of ROMS on Intel Xeon Phi
    Bhaskaran, Gopal
    Gaurav, Pratyush
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 2854 - 2858
  • [27] Fast solution of electromagnetic scattering problems using Xeon Phi coprocessors
    J. L. Campon
    L. Landesa
    [J]. The Journal of Supercomputing, 2019, 75 : 370 - 383
  • [28] High-level support for hybrid parallel execution of C plus plus applications targeting Intel® Xeon Phi™ coprocessors
    Dokulil, Jiri
    Bajrovic, Enes
    Benkner, Siegfried
    Pllana, Sabri
    Sandrieser, Martin
    Bachmayer, Beverly
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 2508 - 2511
  • [29] Fast solution of electromagnetic scattering problems using Xeon Phi coprocessors
    Campon, J. L.
    Landesa, L.
    [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (01): : 370 - 383
  • [30] Performance Evaluation of R with Intel Xeon Phi Coprocessor
    El-Khamra, Yaakoub
    Gaffney, Niall
    Walling, David
    Wernert, Eric
    Xu, Weijia
    Zhang, Hui
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,