BOLT: Privacy-Preserving, Accurate and Efficient Inference for Transformers

被引:2
|
作者
Pang, Qi [1 ]
Zhu, Jinhao [2 ]
Moellering, Helen M. [3 ]
Zheng, Wenting [1 ]
Schneider, Thomas [3 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
[3] Tech Univ Darmstadt, Darmstadt, Germany
基金
欧盟地平线“2020”;
关键词
secure multi-party computation; homomorphic encryption; secure machine learning inference; transformer;
D O I
10.1109/SP54263.2024.00130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The advent of transformers has brought about significant advancements in traditional machine learning tasks. However, their pervasive deployment has raised concerns about the potential leakage of sensitive information during inference. Existing approaches using secure multiparty computation (MPC) face limitations when applied to transformers due to the extensive model size and resource-intensive matrix-matrix multiplications. In this paper, we present BOLT, a privacy-preserving inference framework for transformer models that supports efficient matrix multiplications and nonlinear computations. Combined with our novel machine learning optimizations, BOLT reduces the communication cost by 10.91x. Our evaluation on diverse datasets demonstrates that BOLT maintains comparable accuracy to floating-point models and achieves 4.8-9.5x faster inference across various network settings compared to the state-of-the-art system.
引用
收藏
页码:4753 / 4771
页数:19
相关论文
共 50 条
  • [1] Accurate and efficient privacy-preserving string matching
    Sirintra Vaiwsri
    Thilina Ranbaduge
    Peter Christen
    International Journal of Data Science and Analytics, 2022, 14 : 191 - 215
  • [2] Accurate and efficient privacy-preserving string matching
    Vaiwsri, Sirintra
    Ranbaduge, Thilina
    Christen, Peter
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 14 (02) : 191 - 215
  • [3] Efficient Privacy-Preserving Inference Outsourcing for Convolutional Neural Networks
    Yang, Xuanang
    Chen, Jing
    He, Kun
    Bai, Hao
    Wu, Cong
    Du, Ruiying
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 4815 - 4829
  • [4] EPIDL: Towards efficient and privacy-preserving inference in deep learning
    Nie, Chenfei
    Zhou, Zhipeng
    Dong, Mianxiong
    Ota, Kaoru
    Li, Qiang
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (14):
  • [5] PPCNN: An efficient privacy-preserving CNN training and inference framework
    Zhao, Fan
    Li, Zhi
    Wang, Hao
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10988 - 11018
  • [6] A Privacy-preserving Framework for Rank Inference
    Gao, Yunpeng
    Yan, Tong
    Zhang, Nan
    2017 1ST IEEE SYMPOSIUM ON PRIVACY-AWARE COMPUTING (PAC), 2017, : 180 - 181
  • [7] Privacy-Preserving Deep Learning and Inference
    Riazi, M. Sadegh
    Koushanfar, Farinaz
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
  • [8] Privacy-Preserving Inference in Crowdsourcing Systems
    Xiang, Liyao
    Li, Baochun
    Li, Bo
    2017 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2017, : 1 - 9
  • [9] SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC
    Luo, Jinglong
    Zhang, Yehong
    Zhang, Zhuo
    Zhang, Jiaqi
    Mu, Xin
    Wang, Hui
    Yu, Yue
    Xu, Zenglin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13333 - 13348
  • [10] Efficient Privacy-Preserving Federated Learning Against Inference Attacks for IoT
    Miao, Yifeng
    Chen, Siguang
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,