Primer: Fast Private Transformer Inference on Encrypted Data

被引:1
|
作者
Zheng, Mengxin [1 ]
Lou, Qian [2 ]
Jiang, Lei [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47405 USA
[2] Univ Cent Florida, Orlando, FL 32816 USA
来源
2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC | 2023年
关键词
Fully Homomorphic Encryption; Multi-party Computation; Transformer; Cryptographic Protocol; Private Inference;
D O I
10.1109/DAC56929.2023.10247719
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is increasingly important to enable privacy-preserving inference for cloud services based on Transformers. Post-quantum cryptographic techniques, e.g., fully homomorphic encryption (FHE), and multi-party computation (MPC), are popular methods to support private Transformer inference. However, existing works still suffer from prohibitively computational and communicational overhead. In this work, we present, Primer, to enable a fast and accurate Transformer over encrypted data for natural language processing tasks. In particular, Primer is constructed by a hybrid cryptographic protocol optimized for attention-based Transformer models, as well as techniques including computation merge and tokens-first ciphertext packing. Comprehensive experiments on encrypted language modeling show that Primer achieves state-of-the-art accuracy and reduces the inference latency by 90.6% similar to 97.5% over previous methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] An Encrypted Field Locating Algorithm for Private Protocol Data Based on Data Reconstruction and Moment Eigenvector
    Li, Qing
    Ju, Yonghui
    Zhao, Chang
    He, Xintai
    IEEE ACCESS, 2021, 9 : 42947 - 42958
  • [42] Fast phylogenetic inference from typing data
    João A. Carriço
    Maxime Crochemore
    Alexandre P. Francisco
    Solon P. Pissis
    Bruno Ribeiro-Gonçalves
    Cátia Vaz
    Algorithms for Molecular Biology, 13
  • [43] Fast phylogenetic inference from typing data
    Carrico, Joao A.
    Crochemore, Maxime
    Francisco, Alexandre P.
    Pissis, Solon P.
    Ribeiro-Goncalves, Bruno
    Vaz, Catia
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2018, 13
  • [44] Fast approximate inference for multivariate longitudinal data
    Hughes, David M.
    Garcia-Finana, Marta
    Wand, Matt P.
    BIOSTATISTICS, 2022, 24 (01) : 177 - 192
  • [45] PriML: An Electro-Optical Accelerator for Private Machine Learning on Encrypted Data
    Zheng, Mengxin
    Chen, Fan
    Jiang, Lei
    Lou, Qian
    2023 24TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED, 2023, : 476 - 482
  • [46] Fast keyword search over encrypted data with short ciphertext in clouds
    Tseng, Yi-Fan
    Fan, Chun-, I
    Liu, Zi-Cheng
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 70
  • [47] Fast Privacy-Preserving Keyword Search on Encrypted Outsourced Data
    Wodi, Bryan H.
    Leung, Carson K.
    Cuzzocrea, Alfredo
    Ourav, S.
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019,
  • [48] Fast Multi-keywords Search over Encrypted Cloud Data
    Hong, Cheng
    Li, Yifu
    Zhang, Min
    Feng, Dengguo
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2016, PT I, 2016, 10041 : 433 - 446
  • [49] TRQED: Secure and Fast Tree-Based Private Range Queries over Encrypted Cloud
    Yang, Wei
    Xu, Yang
    Nie, Yiwen
    Shen, Yao
    Huang, Liusheng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2018), PT II, 2018, 10828 : 130 - 146
  • [50] Glyph: Fast and Accurately Training Deep Neural Networks on Encrypted Data
    Lou, Qian
    Feng, Bo
    Fox, Geoffrey C.
    Jiang, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33