Primer: Fast Private Transformer Inference on Encrypted Data

被引:1
|
作者
Zheng, Mengxin [1 ]
Lou, Qian [2 ]
Jiang, Lei [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47405 USA
[2] Univ Cent Florida, Orlando, FL 32816 USA
来源
2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC | 2023年
关键词
Fully Homomorphic Encryption; Multi-party Computation; Transformer; Cryptographic Protocol; Private Inference;
D O I
10.1109/DAC56929.2023.10247719
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is increasingly important to enable privacy-preserving inference for cloud services based on Transformers. Post-quantum cryptographic techniques, e.g., fully homomorphic encryption (FHE), and multi-party computation (MPC), are popular methods to support private Transformer inference. However, existing works still suffer from prohibitively computational and communicational overhead. In this work, we present, Primer, to enable a fast and accurate Transformer over encrypted data for natural language processing tasks. In particular, Primer is constructed by a hybrid cryptographic protocol optimized for attention-based Transformer models, as well as techniques including computation merge and tokens-first ciphertext packing. Comprehensive experiments on encrypted language modeling show that Primer achieves state-of-the-art accuracy and reduces the inference latency by 90.6% similar to 97.5% over previous methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] CryptoGCN: Fast and Scalable Homomorphically Encrypted Graph Convolutional Network Inference
    Ran, Ran
    Xu, Nuo
    Wang, Wei
    Quan, Gang
    Yin, Jieming
    Wen, Wujie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [22] CryptoGCN: Fast and Scalable Homomorphically Encrypted Graph Convolutional Network Inference
    Ran, Ran
    Xu, Nuo
    Wang, Wei
    Quan, Gang
    Yin, Jieming
    Wen, Wujie
    arXiv, 2022,
  • [23] A Fast Search Method for Encrypted Medical Data
    Tian, Ye
    Lei, Hao
    Wang, Liming
    Zeng, Ke
    Fukushima, Toshikazu
    2009 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION WORKSHOPS, VOLS 1 AND 2, 2009, : 116 - 120
  • [24] Big Data, Data Science, and Causal Inference: A Primer for Clinicians
    Raita, Yoshihiko
    Camargo, Carlos A.
    Liang, Liming
    Hasegawa, Kohei
    FRONTIERS IN MEDICINE, 2021, 8
  • [25] CipherFormer: Efficient Transformer Private Inference with Low Round Complexity
    Wang, Weize
    Kuang, Yi
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 3054 - 3059
  • [26] DReP: Deep ReLU pruning for fast private inference
    Hu, Peng
    Sun, Lei
    Hu, Cuiyun
    Dai, Leyu
    Guo, Song
    Yu, Miao
    JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 152
  • [27] Private query on encrypted data in multi-user settings
    Bao, Feng
    Deng, Robert H.
    Ding, Xuhua
    Yang, Yanjiang
    INFORMATION SECURITY PRACTICE AND EXPERIENCE, 2008, 4991 : 71 - +
  • [28] Efficient Private Conjunctive Query Protocol Over Encrypted Data
    Saha, Tushar Kanti
    Koshiba, Takeshi
    CRYPTOGRAPHY, 2021, 5 (01) : 1 - 28
  • [29] Practical Searching Over Encrypted Data By Private Information Retrieval
    Yoshida, Rei
    Cui, Yang
    Sekino, Tomohiro
    Shigetomi, Rie
    Otsuka, Akira
    Imai, Hideki
    2010 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE GLOBECOM 2010, 2010,
  • [30] Fast query over encrypted character data in database
    Wang, ZF
    Dai, J
    Wang, W
    Shi, BL
    COMPUTATIONAL AND INFORMATION SCIENCE, PROCEEDINGS, 2004, 3314 : 1027 - 1033