Benchmarking Data Analysis and Machine Learning Applications on the Intel KNL Many-Core Processor

被引:0
|
作者
Byun, Chansup [1 ]
Kepner, Jeremy [1 ]
Arcand, William [1 ]
Bestor, David [1 ]
Bergeron, Bill [1 ]
Gadepally, Vijay [1 ]
Houle, Michael [1 ]
Hubbell, Matthew [1 ]
Jones, Michael [1 ]
Klein, Anna [1 ]
Michaleas, Peter [1 ]
Milechin, Lauren [1 ]
Mullen, Julie [1 ]
Prout, Andrew [1 ]
Rosa, Antonio [1 ]
Samsi, Siddharth [1 ]
Yee, Charles [1 ]
Reuther, Albert [1 ]
机构
[1] MIT, Lincoln Lab, Supercomp Ctr, 244 Wood St, Lexington, MA 02173 USA
关键词
Benchmark; MATLAB; Octave; DGEMM; throughput; performance; machine learning; Caffe; Haswell; Knights Landing;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Knights Landing (KNL) is the code name for the second-generation Intel Xeon Phi product family. KNL has generated significant interest in the data analysis and machine learning communities because its new many-core architecture targets both of these workloads. The KNL many-core vector processor design enables it to exploit much higher levels of parallelism. At the Lincoln Laboratory Supercomputing Center (LLSC), the majority of users are running data analysis applications such as MATLAB and Octave. More recently, machine learning applications, such as the UC Berkeley Caffe deep learning framework, have become increasingly important to LLSC users. Thus, the performance of these applications on KNL systems is of high interest to LLSC users and the broader data analysis and machine learning communities. Our data analysis benchmarks of these application on the Intel KNL processor indicate that single-core double-precision generalized matrix multiply (DGEMM) performance on KNL systems has improved by similar to 3.5x compared to prior Intel Xeon technologies. Our data analysis applications also achieved similar to 60% of the theoretical peak performance. Also a performance comparison of a machine learning application, Caffe, between the two different Intel CPUs, Xeon E5 v3 and Xeon Phi 7210, demonstrated a 2.7x improvement on a KNL node.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Many-Core Scheduling of Data Parallel Applications using SMT Solvers
    Tendulkar, Pranav
    Poplavko, Peter
    Galanommatis, Ioannis
    Maler, Oded
    2014 17TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2014, : 615 - 622
  • [42] A Study of Euclidean Distance Matrix Computation on Intel Many-Core Processors
    Rechkalov, Timofey
    Zymbler, Mikhail
    PARALLEL COMPUTATIONAL TECHNOLOGIES, PCT 2018, 2018, 910 : 200 - 215
  • [43] Temperature-aware Thread Assignment of Many-core Processor
    Xuan, SheXiao
    Yang, Y.
    PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2015), 2015, : 332 - 336
  • [44] Characterizing and Optimizing Transformer Inference on ARM Many-core Processor
    Jiang, Jiazhi
    Du, Jiangsu
    Huang, Dan
    Li, Dongsheng
    Zheng, Jiang
    Lu, Yutong
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022, 2022,
  • [45] On the Use of a Many-core Processor for Computational Fluid Dynamics Simulations
    Raase, Sebastian
    Nordstrom, Tomas
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 1403 - 1412
  • [46] Advanced Power Devices for Many-core Processor Power Supplies
    Briere, Michael A.
    2010 INTERNATIONAL ELECTRON DEVICES MEETING - TECHNICAL DIGEST, 2010,
  • [47] Parallel AES Encryption Engines for Many-Core Processor Arrays
    Liu, Bin
    Baas, Bevan M.
    IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (03) : 536 - 547
  • [48] Efficient Workload Balance Technology on Many-core Crypto Processor
    Dai Zibin
    Yin Anqi
    Qu Tongzhou
    Nan Longmei
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (02) : 369 - 376
  • [49] IXPUG Workshop: Many-Core Computing on Intel Processors: Applications, Performance and Best-Practice Solutions
    Steinke, Thomas
    Pennycook, Simon J.
    Suarez, Estela
    Martin, David E.
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2018, 2018, 11203 : 456 - 461
  • [50] Accelerating DES and AES Algorithms for a Heterogeneous Many-core Processor
    Xing, Biao
    Wang, DanDan
    Yang, Yongquan
    Wei, Zhiqiang
    Wu, Jiajing
    He, Cuihua
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2021, 49 (03) : 463 - 486