BOLT: Privacy-Preserving, Accurate and Efficient Inference for Transformers

被引:2
|
作者
Pang, Qi [1 ]
Zhu, Jinhao [2 ]
Moellering, Helen M. [3 ]
Zheng, Wenting [1 ]
Schneider, Thomas [3 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
[3] Tech Univ Darmstadt, Darmstadt, Germany
基金
欧盟地平线“2020”;
关键词
secure multi-party computation; homomorphic encryption; secure machine learning inference; transformer;
D O I
10.1109/SP54263.2024.00130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The advent of transformers has brought about significant advancements in traditional machine learning tasks. However, their pervasive deployment has raised concerns about the potential leakage of sensitive information during inference. Existing approaches using secure multiparty computation (MPC) face limitations when applied to transformers due to the extensive model size and resource-intensive matrix-matrix multiplications. In this paper, we present BOLT, a privacy-preserving inference framework for transformer models that supports efficient matrix multiplications and nonlinear computations. Combined with our novel machine learning optimizations, BOLT reduces the communication cost by 10.91x. Our evaluation on diverse datasets demonstrates that BOLT maintains comparable accuracy to floating-point models and achieves 4.8-9.5x faster inference across various network settings compared to the state-of-the-art system.
引用
收藏
页码:4753 / 4771
页数:19
相关论文
共 50 条
  • [31] Secure Outsourced SIFT: Accurate and Efficient Privacy-Preserving Image SIFT Feature Extraction
    Liu, Xiang
    Zhao, Xueli
    Xia, Zhihua
    Feng, Qian
    Yu, Peipeng
    Weng, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4635 - 4648
  • [32] Privacy-Preserving Homomorphic MACs with Efficient Verification
    Li, Shimin
    Wang, Xin
    Zhang, Rui
    WEB SERVICES - ICWS 2018, 2018, 10966 : 100 - 115
  • [33] Efficient Privacy-preserving Aggregation for Mobile Crowdsensing
    Huai, Mengdi
    Huang, Liusheng
    Sun, Yu-e
    Yang, Wei
    PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, : 275 - 280
  • [34] An Efficient Privacy-preserving Authentication Protocol in VANETs
    Zhang, Jianhong
    Zhen, Weina
    Xu, Min
    2013 IEEE NINTH INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SENSOR NETWORKS (MSN 2013), 2013, : 272 - 277
  • [35] Efficient Privacy-Preserving Face Identification Protocol
    Huang, Hai
    Wang, Luyao
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (04) : 2632 - 2641
  • [36] An efficient privacy-preserving approach for data publishing
    Xinyu Qian
    Xinning Li
    Zhiping Zhou
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 2077 - 2093
  • [37] Efficient Privacy-Preserving Facial Expression Classification
    Rahulamathavan, Yogachandran
    Rajarajan, Muttukrishnan
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2017, 14 (03) : 326 - 338
  • [38] EPiC: efficient privacy-preserving counting for MapReduce
    Triet Dang Vo-Huu
    Erik-Oliver Blass
    Guevara Noubir
    Computing, 2019, 101 : 1265 - 1286
  • [39] Novel and Efficient Privacy-Preserving Continuous Authentication
    Baig, Ahmed Fraz
    Eskeland, Sigurd
    Yang, Bian
    CRYPTOGRAPHY, 2024, 8 (01)
  • [40] EPiC: efficient privacy-preserving counting for MapReduce
    Triet Dang Vo-Huu
    Blass, Erik-Oliver
    Noubir, Guevara
    COMPUTING, 2019, 101 (09) : 1265 - 1286