Parallel SHA-256 on SW26010 many-core processor for hashing of multiple messages

被引:0
|
作者
Ziheng Wang
Xiaoshe Dong
Yan Kang
Heng Chen
机构
[1] Xi’an Jiaotong University,School of Computer Science and Technology
来源
关键词
SW26010; SHA-256; Multiple messages; Instruction level;
D O I
暂无
中图分类号
学科分类号
摘要
To explore whether new parallelism techniques can provide additional performance improvements in cryptographic hash functions, we conducted our study with the SW26010, which is a special-architecture processor on Sunway TaihuLight, one of the world’s fastest supercomputers. Secure Hash Algorithms (SHAs) are significant for secure transmission, with SHA-256 remaining a safe and most efficient SHA design. We propose SW-SHA-256, a parallel SHA-256 implementation for hashing of multiple messages on the SW26010. Our work explores the parallel schemes at the instruction and thread levels. At the instruction level, we use vector registers to load multiple messages to complete hashing simultaneously. Assembly-level optimization methods such as dual issue are used, and the pipeline is distinct from that of a general-purpose processor. At the thread level, the optimized DMA transmission strategy and double buffer technique are used to reduce the cost from memory to cache. As a result, we obtain 5.87 cycles per byte in a single core which is 8.18X speed up faster than the C code in OpenSSLv3.0.0. Moreover, our implementation achieves a throughput of 60.21 GB/s on a SW26010 processor and is highly scalable.
引用
收藏
页码:2332 / 2355
页数:23
相关论文
共 50 条
  • [41] Parallel AES Encryption Engines for Many-Core Processor Arrays
    Liu, Bin
    Baas, Bevan M.
    IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (03) : 536 - 547
  • [42] Publisher Correction: xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor
    Fangfang Liu
    Wenjing Ma
    Yuwen Zhao
    Daokun Chen
    Yi Hu
    Qinglin Lu
    WanWang Yin
    Xinhui Yuan
    Lijuan Jiang
    Hao Yan
    Min Li
    Hongsen Wang
    Xinyu Wang
    Chao Yang
    CCF Transactions on High Performance Computing, 2023, 5 : 97 - 97
  • [43] Research and Optimization of the Winograd-Based Convolutional Algorithm on ShenWei-26010 Many-Core Processor
    Wu Z.
    Jin X.
    An H.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04): : 955 - 972
  • [44] SunwayImg: A Parallel Image Processing Library for the Sunway Many-Core Processor
    Liu, Rui
    Liu, Yi
    Zhao, Meiting
    Song, Kaida
    Qian, Depei
    IEEE ACCESS, 2019, 7 : 128555 - 128569
  • [45] A Parallel Bloom Filter String Searching Algorithm on a Many-core Processor
    Ong, WenMei
    Baskaran, Vishnu Monn
    Chong, Poh Kit
    Ettikan, K. K.
    Ong, Keh Kok Y.
    2013 IEEE CONFERENCE ON OPEN SYSTEMS (ICOS), 2013, : 1 - +
  • [46] A Many-Core Co-Processor for Embedded Parallel Computing on FPGA
    Jose, Wilson
    Neto, Horacio
    Vestias, Mario
    2015 EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2015, : 539 - 542
  • [47] Efficient parallel HEVC intra-prediction on many-core processor
    Yan, C.
    Zhang, Y.
    Dai, F.
    Zhang, J.
    Li, L.
    Dai, Q.
    ELECTRONICS LETTERS, 2014, 50 (11) : 805 - U53
  • [48] Optimizing massively parallel sparse matrix computing on ARM many-core processor
    Zheng, Jiang
    Jiang, Jiazhi
    Du, Jiangsu
    Huang, Dan
    Lu, Yutong
    PARALLEL COMPUTING, 2023, 117
  • [49] GPU ACCELERATED PARALLEL BRANCH PREDICTION FOR MULTI/MANY-CORE PROCESSOR SIMULATION
    He, Liqiang
    Zhang, Guangyong
    Jiang, Jingdong
    INTERNATIONAL JOURNAL OF NUMERICAL ANALYSIS AND MODELING, 2012, 9 (02) : 193 - 207
  • [50] New system software for parallel programming models on the Intel SCC many-core processor
    Clauss, Carsten
    Lankes, Stefan
    Reble, Pablo
    Bemmerl, Thomas
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (09): : 2235 - 2259