ConvKyber: Unleashing the Power of AI Accelerators for Faster Kyber with Novel Iteration-based Approaches

被引:0
|
作者
Zhou T. [1 ]
Zheng F. [2 ]
Fan G. [3 ]
Wan L. [2 ]
Tang W. [1 ]
Song Y. [3 ]
Bian Y. [4 ]
Lin J. [1 ,5 ]
机构
[1] School of Cyber Security, University of Science and Technology of China, Heifei
[2] School of CryptologyUniversity of Chinese Academy of Sciences, Beijing
[3] Ant Group, Hangzhou
[4] School of Computer Science and TechnologyUniversity of Chinese Academy of Sciences, Beijing
[5] Beijing Research Institute, University of Science and Technology of China, Beijing
基金
中国国家自然科学基金;
关键词
GPUs; Kyber; Lattice-based Cryptography; Tensor Core;
D O I
10.46586/tches.v2024.i2.25-63
中图分类号
学科分类号
摘要
The remarkable performance capabilities of AI accelerators offer promising opportunities for accelerating cryptographic algorithms, particularly in the context of lattice-based cryptography. However, current approaches to leveraging AI accelerators often remain at a rudimentary level of implementation, overlooking the intricate internal mechanisms of these devices. Consequently, a significant number of computational resources is underutilized. In this paper, we present a comprehensive exploration of NVIDIA Tensor Cores and introduce a novel framework tailored specifically for Kyber. Firstly, we propose two innovative approaches that efficiently break down Kyber’s NTT into iterative matrix multiplications, resulting in approximately a 75% reduction in costs compared to the state-of-the-art scanning-based methods. Secondly, by reversing the internal mechanisms, we precisely manipulate the internal resources of Tensor Cores using assembly-level code instead of inefficient standard interfaces, eliminating memory accesses and redundant function calls. Finally, building upon our highly optimized NTT, we provide a complete implementation for all parameter sets of Kyber. Our implementation surpasses the state-of-the-art Tensor Core based work, achieving remarkable speed-ups of 1.93x, 1.65x, 1.22x and 3.55x for polyvec_ntt, KeyGen, Enc and Dec in Kyber-1024, respectively. Even when considering execution latency, our throughput-oriented full Kyber implementation maintains an acceptable execution latency. For instance, the execution latency ranges from 1.02 to 5.68 milliseconds for Kyber-1024 on R3080 when achieving the peak throughput. © 2024, Ruhr-University of Bochum. All rights reserved.
引用
收藏
页码:25 / 63
页数:38
相关论文
共 26 条
  • [21] Enhancing Power Production: A Novel Hydrogen-Based Scheme With Comprehensive Analysis and AI-Optimized Criteria
    Mohammadzadeh, Vahid
    Saboohi, Zoheir
    Ommi, Fathollah
    Gholamian, Ehsan
    ENERGY STORAGE, 2024, 6 (05)
  • [22] Novel Metamaterial and AI-based Multi-Objective Optimization of Coil Parameters for Efficient Wireless Power Transfer
    Adepoju, Webster O.
    Bhattacharya, Indranil
    Bima, Muhammad E.
    Banik, Trapa
    2021 IEEE VEHICLE POWER AND PROPULSION CONFERENCE (VPPC), 2021,
  • [23] A Novel Framework for Applying Multiobjective GA and PSO Based Approaches for Simultaneous Area, Delay, and Power Optimization in High Level Synthesis of Datapaths
    Ram, D. S. Harish
    Bhuvaneswari, M. C.
    Prabhu, Shanthi S.
    VLSI DESIGN, 2012,
  • [24] Nuclear Power Plant accident identification system with "don't know" response capability: Novel deep learning-based approaches
    Cabral Pinheiro, Victor Henrique
    dos Santos, Marcelo Carvalho
    Moreira do Desterro, Filipe Santana
    Schirru, Roberto
    do Nascimento Abreu Pereira, Claudio Marcio
    ANNALS OF NUCLEAR ENERGY, 2020, 137
  • [25] Two novel proposed discrete wavelet transform and filter based approaches for short-circuit faults detection in power transmission lines
    Fathabadi, Hassan
    APPLIED SOFT COMPUTING, 2015, 36 : 375 - 382
  • [26] A novel mutation-proof, next-generation vaccine to fight against upcoming SARS-CoV-2 variants and subvariants, designed through AI enabled approaches and tools, along with the machine learning based immune simulation: A vaccine breakthrough
    Bhattacharya, Manojit
    Alshammari, Abdulrahman
    Alharbi, Metab
    Dhama, Kuldeep
    Lee, Sang -Soo
    Chakraborty, Chiranjib
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 242