Parallel Implementations of ARX-Based Block Ciphers on Graphic Processing Units

被引:5
|
作者
An, SangWoo [1 ]
Kim, YoungBeom [2 ]
Kwon, Hyeokdong [3 ]
Seo, Hwajeong [3 ]
Seo, Seog Chung [2 ]
机构
[1] Kookmin Univ, Dept Financial Informat Secur, Seoul 02707, South Korea
[2] Kookmin Univ, Dept Informat Secur Cryptol & Math, Seoul 02707, South Korea
[3] Hansung Univ, Div IT Convergence Engn, Seoul 02876, South Korea
基金
新加坡国家研究基金会;
关键词
CHAM; LEA; HIGHT; Graphic Processing Unit (GPU); CUDA; Counter (CTR) mode; parallel processing;
D O I
10.3390/math8111894
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
With the development of information and communication technology, various types of Internet of Things (IoT) devices have widely been used for convenient services. Many users with their IoT devices request various services to servers. Thus, the amount of users' personal information that servers need to protect has dramatically increased. To quickly and safely protect users' personal information, it is necessary to optimize the speed of the encryption process. Since it is difficult to provide the basic services of the server while encrypting a large amount of data in the existing CPU, several parallel optimization methods using Graphics Processing Units (GPUs) have been considered. In this paper, we propose several optimization techniques using GPU for efficient implementation of lightweight block cipher algorithms on the server-side. As the target algorithm, we select high security and light weight (HIGHT), Lightweight Encryption Algorithm (LEA), and revised CHAM, which are Add-Rotate-Xor (ARX)-based block ciphers, because they are used widely on IoT devices. We utilize the features of the counter (CTR) operation mode to reduce unnecessary memory copying and operations in the GPU environment. Besides, we optimize the memory usage by making full use of GPU's on-chip memory such as registers and shared memory and implement the core function of each target algorithm with inline PTX assembly codes for maximizing the performance. With the application of our optimization methods and handcrafted PTX codes, we achieve excellent encryption throughput of 468, 2593, and 3063 Gbps for HIGHT, LEA, and revised CHAM on RTX 2070 NVIDIA GPU, respectively. In addition, we present optimized implementations of Counter Mode Based Deterministic Random Bit Generator (CTR_DRBG), which is one of the widely used deterministic random bit generators to provide a large amount of random data to the connected IoT devices. We apply several optimization techniques for maximizing the performance of CTR_DRBG, and we achieve 52.2, 24.8, and 34.2 times of performance improvement compared with CTR_DRBG implementation on CPU-side when HIGHT-64/128, LEA-128/128, and CHAM-128/128 are used as underlying block cipher algorithm of CTR_DRBG, respectively.
引用
收藏
页码:1 / 25
页数:25
相关论文
共 50 条
  • [1] Compact Implementations of ARX-Based Block Ciphers on IoT Processors
    Seo, Hwajeong
    Jeong, Ilwoong
    Lee, Jungkeun
    Kim, Woo-Hwan
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (03)
  • [2] Cryptanalysis of Selected ARX-Based Block Ciphers
    Gundaram, Praveen Kumar
    [J]. VIETNAM JOURNAL OF COMPUTER SCIENCE, 2024,
  • [3] Fast implementations of ARX-based lightweight block ciphers (SPARX, CHAM) on 32-bit processor
    Seok, Byoungjin
    Lee, Changhoon
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2019, 15 (09)
  • [4] Efficient Parallel Implementation of CTR Mode of ARX-Based Block Ciphers on ARMv8 Microcontrollers
    Song, JinGyo
    Seo, Seog Chung
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (06):
  • [5] Efficient Implementation of ARX-Based Block Ciphers on 8-Bit AVR Microcontrollers
    Kim, YoungBeom
    Kwon, Hyeokdong
    An, SangWoo
    Seo, Hwajeong
    Seo, Seog Chung
    [J]. MATHEMATICS, 2020, 8 (10) : 1 - 22
  • [6] Efficient Computation of Boomerang Connection Probability for ARX-Based Block Ciphers with Application to SPECK and LEA
    Kim, Dongyeong
    Kwon, Dawoon
    Song, Junghwan
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2020, E103A (04) : 677 - 685
  • [7] Open Problems of Proving Security of ARX-based Ciphers to Differential Cryptanalysis
    Rodinko, Mariia
    Oliynykov, Roman
    [J]. 2017 4TH INTERNATIONAL SCIENTIFIC-PRACTICAL CONFERENCE PROBLEMS OF INFOCOMMUNICATIONS-SCIENCE AND TECHNOLOGY (PIC S&T), 2017, : 228 - 231
  • [8] Secure and Fast Implementation of ARX-Based Block Ciphers Using ASIMD Instructions in ARMv8 Platforms
    Song, Jingyo
    Seo, Seog Chung
    [J]. IEEE ACCESS, 2020, 8 : 193138 - 193153
  • [9] Highly Efficient Implementation of Block Ciphers on Graphic Processing Units for Massively Large Data
    An, SangWoo
    Seo, Seog Chung
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (11):
  • [10] DABC: A dynamic ARX-based lightweight block cipher with high diffusion
    Chen, Wen
    Li, Lang
    Guo, Ying
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (01): : 165 - 184