On the energy consumption of Load/Store AVX instructions

被引:5
|
作者
Jakobs, Thomas [1 ]
Ruenger, Gudula [1 ]
机构
[1] Tech Univ Chemnitz, Dept Comp Sci, D-09111 Chemnitz, Germany
关键词
Energy consumption; power consumption; AVX instruction set; Gaussian elimination;
D O I
10.15439/2018F28
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The energy efficiency of program executions is an active research field in recent years and the influence of different programming styles on the energy consumption is part of the research effort. In this article, we concentrate on SIMD programming and study the effect of vectorization on performance as well as on power and energy consumption. Especially, SIMD programs using AVX instructions are considered and the focus is on the AVX load and store instruction set. Several semantically similar but different load and store instructions are selected and are used to build different program versions of for the same algorithm. As example application, the Gaussian elimination has been chosen due to its interesting feature of using arrays of varying length in each factorization step. Five different SIMD program versions of the Gaussian elimination have been implemented, each of which uses different load and store instructions. Performance, power, and energy measurements for all program versions are provided for the Intel Sandy Bridge, Haswell and Skylake architectures and the results are discussed and analyzed.
引用
收藏
页码:319 / 327
页数:9
相关论文
共 50 条
  • [1] BENCHMARKING VECTOR INDIRECT LOAD STORE INSTRUCTIONS
    UEHARA, T
    TSUDA, T
    SUPERCOMPUTER, 1991, 8 (06): : 57 - 74
  • [2] Classifying load and store instructions for memory renaming
    Reinman, Glenn
    Calder, Brad
    Tullsen, Dean
    Tyson, Gary
    Austin, Todd
    Proceedings of the International Conference on Supercomputing, 1999, : 399 - 407
  • [3] Reducing data cache energy consumption via cached load/store queue
    Nicolaescu, D
    Veidenbaum, A
    Nicolau, A
    ISLPED'03: PROCEEDINGS OF THE 2003 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2003, : 252 - 257
  • [4] Efficient embedded code generation with multiple load/store instructions
    Paek, Yunheung
    Ahn, Minwook
    Cho, Doosan
    Kim, Taehwan
    SOFTWARE-PRACTICE & EXPERIENCE, 2007, 37 (11): : 1133 - 1159
  • [5] Thorough Evaluation of GPU Shared Memory Load and Store Instructions
    Okamoto, Satoshi
    Ito, Yasuaki
    Nakano, Koji
    Bordim, Jacir L.
    2014 SECOND INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2014, : 614 - 616
  • [6] An Exploration of Using the Intel AVX2 Gather Load Instructions for Vectorised Image Processing
    Cree, Michael J.
    2018 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2018,
  • [7] Fast Quicksort Implementation Using AVX Instructions
    Gueron, Shay
    Krasnov, Vlad
    COMPUTER JOURNAL, 2016, 59 (01): : 83 - 90
  • [8] Acceleration of Particle Swarm Optimization with AVX Instructions
    Safarik, Jakub
    Snasel, Vaclav
    APPLIED SCIENCES-BASEL, 2023, 13 (02):
  • [9] Performance Study of AVX Instructions for the FDTD Method
    Chen, Gen
    Zhao, Lei
    Li, Wen
    Zhao, Huadong
    Yu, Wenhua
    2013 CROSS STRAIT QUAD-REGIONAL RADIO SCIENCE AND WIRELESS TECHNOLOGY CONFERENCE (CSQRWC), 2013, : 175 - 178
  • [10] String searching with mismatches using AVX2 and AVX-512 instructions
    Chhabra, Tamanna
    Ghuman, Sukhpal Singh
    Tarhio, Jorma
    INFORMATION PROCESSING LETTERS, 2025, 189