Primer: Fast Private Transformer Inference on Encrypted Data

被引:1
|
作者
Zheng, Mengxin [1 ]
Lou, Qian [2 ]
Jiang, Lei [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47405 USA
[2] Univ Cent Florida, Orlando, FL 32816 USA
来源
2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC | 2023年
关键词
Fully Homomorphic Encryption; Multi-party Computation; Transformer; Cryptographic Protocol; Private Inference;
D O I
10.1109/DAC56929.2023.10247719
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is increasingly important to enable privacy-preserving inference for cloud services based on Transformers. Post-quantum cryptographic techniques, e.g., fully homomorphic encryption (FHE), and multi-party computation (MPC), are popular methods to support private Transformer inference. However, existing works still suffer from prohibitively computational and communicational overhead. In this work, we present, Primer, to enable a fast and accurate Transformer over encrypted data for natural language processing tasks. In particular, Primer is constructed by a hybrid cryptographic protocol optimized for attention-based Transformer models, as well as techniques including computation merge and tokens-first ciphertext packing. Comprehensive experiments on encrypted language modeling show that Primer achieves state-of-the-art accuracy and reduces the inference latency by 90.6% similar to 97.5% over previous methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] SecureTLM: Private inference for transformer-based large model with MPC
    Chen, Yuntian
    Meng, Xianjia
    Shi, Zhiying
    Ning, Zhiyuan
    Lin, Jingzhi
    INFORMATION SCIENCES, 2024, 667
  • [32] SentinelLMs: Encrypted Input Adaptation and Fine-Tuning of Language Models for Private and Secure Inference
    Mishra, Abhijit
    Li, Mingda
    Deo, Soham
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21403 - 21411
  • [33] AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
    Peng, Hongwu
    Huang, Shaoyi
    Zhou, Tong
    Luo, Yukui
    Wang, Chenghong
    Wang, Zigeng
    Zhao, Jiahui
    Xie, Xi
    Li, Ang
    Geng, Tony
    Mahmood, Kaleel
    Wen, Wujie
    Xu, Xiaolin
    Ding, Caiwen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5155 - 5165
  • [34] Cloud based private data analytic using secure computation over encrypted data
    Zaraket, Christiana
    Hariss, Khalil
    Chamoun, Maroun
    Nicolas, Tony
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 4931 - 4942
  • [35] Authorized Private Keyword Search over Encrypted Data in Cloud Computing
    Li, Ming
    Yu, Shucheng
    Cao, Ning
    Lou, Wenjing
    31ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2011), 2011, : 383 - 392
  • [36] Characterization of MPC-based Private Inference for Transformer-based Models
    Wang, Yongqin
    Edward, G.
    Xiong, Wenjie
    Lefaudeux, Benjamin
    Knott, Brian
    Annavaram, Murali
    Lee, Hsien-Hsin S.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2022), 2022, : 187 - 197
  • [37] SHE: A Fast and Accurate Deep Neural Network for Encrypted Data
    Lou, Qian
    Jiang, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [38] APPCLASSIFIER: Automated App Inference on Encrypted Traffic via Meta Data Analysis
    Xiang, Chong
    Chen, Qingrong
    Xue, Minhui
    Zhu, Haojin
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [39] Edge-Assisted CNN Inference over Encrypted Data for Internet of Things
    Tian, Yifan
    Yuan, Jiawei
    Yu, Shucheng
    Hou, Yantian
    Song, Houbing
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM, PT I, 2019, 304 : 85 - 104
  • [40] Differentially-Private Data Aggregation over Encrypted Location Data for Range Counting Query
    Sasada, Taisho
    Kaaniche, Nesrine
    Laurent, Maryline
    Taenaka, Yuzo
    Kadobayashi, Youki
    38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 409 - 414