A hybrid CPU/GPU Scheme for Optimizing ChaCha20 Stream Cipher

被引:7
|
作者
Wang, Ziheng [1 ]
Chen, Heng [1 ]
Cai, Weiling [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian, Peoples R China
关键词
ChaCha20; GPU; MPI; AES; FPGA;
D O I
10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00161
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The secure transmission of large-scale data has attracted more and more attention. In the widely recognized security protocol TLSv1.3, the only algorithms that support large-scale data en-/decryption are ChaCha20 and the Advanced Encryption Standard (AES). Although AES has a higher usage rate, ChaCha20 still has the advantage of speed and security on many platforms, and has a better performance against postquantum attacks. However, for a CPU/GPU platform, compared to the AES algorithm, no work has fully described the application scheme of ChaCha. This paper proposes an optimization scheme to optimize the performance of the ChaCha20 algorithm on a CPU/GPU platform. On a CPU platform, we provide a parallelization implementation that is better than that of OpenSSL. On a single GPU, our implementation of ChaCha20 achieves peak throughput of 211.41GB/s, which is better than any previous implementation of ChaCha20 and AES algorithms on GPU. More importantly, we are the first to detail the optimization of ChaCha on GPU. When considering the interconnection between CPU and GPU, we use the 87.76% peak bidirectional bandwidth of a PCIe channel. Finally, we also provide a scheme for the application of ChaCha20 on a CPU/GPU platform.
引用
收藏
页码:1171 / 1178
页数:8
相关论文
共 19 条
  • [1] Performance of Parallel ChaCha20 Stream Cipher
    Velea, Radu
    Gurzau, Florina
    Margarit, Laurentiu
    Bica, Ion
    Patriciu, Victor-Valeriu
    [J]. 2016 IEEE 11TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI), 2016, : 391 - 395
  • [2] Implementation and optimization of ChaCha20 stream cipher on sunway taihuLight supercomputer
    Cai, Weilin
    Chen, Heng
    Wang, Ziheng
    Zhang, Xingjun
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (03): : 4199 - 4216
  • [3] Implementation and optimization of ChaCha20 stream cipher on sunway taihuLight supercomputer
    Weilin Cai
    Heng Chen
    Ziheng Wang
    Xingjun Zhang
    [J]. The Journal of Supercomputing, 2022, 78 : 4199 - 4216
  • [4] High-Performance Hardware Structure of ChaCha20 Stream Cipher Based on Sparse Parallel Prefix Adder
    Rashidi, Bahram
    [J]. INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2024,
  • [5] Event-based cryptography for automation networks of cyber-physical systems using the stream cipher ChaCha20
    Lima, Publio M.
    da Silva, Carlos K. P.
    de Farias, Claudio M.
    Carvalho, Lilian K.
    Moreira, Marcos, V
    [J]. IFAC PAPERSONLINE, 2022, 55 (28): : 58 - 65
  • [6] Extended-Chacha20 Stream Cipher With Enhanced Quarter Round Function
    Kebande, Victor R.
    [J]. IEEE ACCESS, 2023, 11 : 114220 - 114237
  • [7] A hybrid CPU/GPU approach for optimizing sorting throughput
    Gowanlock, Michael
    Karsin, Ben
    [J]. PARALLEL COMPUTING, 2019, 85 : 45 - 55
  • [8] Optimizing tensor contraction expressions for hybrid CPU-GPU execution
    Wenjing Ma
    Sriram Krishnamoorthy
    Oreste Villa
    Karol Kowalski
    Gagan Agrawal
    [J]. Cluster Computing, 2013, 16 : 131 - 155
  • [9] Optimizing tensor contraction expressions for hybrid CPU-GPU execution
    Ma, Wenjing
    Krishnamoorthy, Sriram
    Villa, Oreste
    Kowalski, Karol
    Agrawal, Gagan
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (01): : 131 - 155
  • [10] CPU-GPU hybrid computing for feature extraction from video stream
    Lee, Sungju
    Kim, Heegon
    Park, Daihee
    Chung, Yongwha
    Jeong, Taikyeong
    [J]. IEICE ELECTRONICS EXPRESS, 2014, 11 (22):