Primer: Fast Private Transformer Inference on Encrypted Data

被引:1
|
作者
Zheng, Mengxin [1 ]
Lou, Qian [2 ]
Jiang, Lei [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47405 USA
[2] Univ Cent Florida, Orlando, FL 32816 USA
来源
2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC | 2023年
关键词
Fully Homomorphic Encryption; Multi-party Computation; Transformer; Cryptographic Protocol; Private Inference;
D O I
10.1109/DAC56929.2023.10247719
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is increasingly important to enable privacy-preserving inference for cloud services based on Transformers. Post-quantum cryptographic techniques, e.g., fully homomorphic encryption (FHE), and multi-party computation (MPC), are popular methods to support private Transformer inference. However, existing works still suffer from prohibitively computational and communicational overhead. In this work, we present, Primer, to enable a fast and accurate Transformer over encrypted data for natural language processing tasks. In particular, Primer is constructed by a hybrid cryptographic protocol optimized for attention-based Transformer models, as well as techniques including computation merge and tokens-first ciphertext packing. Comprehensive experiments on encrypted language modeling show that Primer achieves state-of-the-art accuracy and reduces the inference latency by 90.6% similar to 97.5% over previous methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Fast homomorphic SVM inference on encrypted data
    Al Badawi, Ahmad
    Chen, Ling
    Vig, Saru
    Neural Computing and Applications, 2022, 34 (18) : 15555 - 15573
  • [2] Fast homomorphic SVM inference on encrypted data
    Al Badawi, Ahmad
    Chen, Ling
    Vig, Saru
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15555 - 15573
  • [3] Fast homomorphic SVM inference on encrypted data
    Ahmad Al Badawi
    Ling Chen
    Saru Vig
    Neural Computing and Applications, 2022, 34 : 15555 - 15573
  • [4] Fast Permutation Architecture on Encrypted Data for Secure Neural Network Inference
    Hu, Xiao
    Tian, Jing
    Wang, Zhongfeng
    APCCAS 2020: PROCEEDINGS OF THE 2020 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2020), 2020, : 141 - 144
  • [5] Fast and private multi-dimensional range search over encrypted data
    Kermanshahi, Shabnam Kasra
    Steinfeld, Ron
    Yi, Xun
    Liu, Joseph K.
    Nepal, Surya
    Lou, Junwei
    INFORMATION SCIENCES, 2024, 652
  • [6] Private queries on encrypted genomic data
    Cetin, Gizem S.
    Chen, Hao
    Laine, Kim
    Lauter, Kristin
    Rindal, Peter
    Xia, Yuhou
    BMC MEDICAL GENOMICS, 2017, 10
  • [7] Private queries on encrypted genomic data
    Gizem S. Çetin
    Hao Chen
    Kim Laine
    Kristin Lauter
    Peter Rindal
    Yuhou Xia
    BMC Medical Genomics, 10
  • [8] Private Computation on Encrypted Genomic Data
    Lauter, Kristin
    Lopez-Alt, Adriana
    Naehrig, Michael
    PROGRESS IN CRYPTOLOGY - LATINCRYPT 2014, 2015, 8895 : 3 - 27
  • [9] Towards Fast and Scalable Private Inference
    Mo, Jianqiao
    Garimella, Karthik
    Neda, Negar
    Ebel, Austin
    Reagen, Brandon
    PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2023, CF 2023, 2023, : 322 - 328
  • [10] Private AI: Machine Learning on Encrypted Data
    Lauter, Kristin
    RECENT ADVANCES IN INDUSTRIAL AND APPLIED MATHEMATICS, 2022, : 97 - 113