SGBoost: An Efficient and Privacy-Preserving Vertical Federated Tree Boosting Framework

被引:8
|
作者
Zhao, Jiaqi [1 ]
Zhu, Hui [1 ]
Xu, Wei [1 ]
Wang, Fengwei [1 ]
Lu, Rongxing [2 ]
Li, Hui [1 ]
机构
[1] Xidian Univ, Sch Cyber Engn, Xian 710126, Shaanxi, Peoples R China
[2] Univ New Brunswick, Fac Comp Sci, Fredericton, NB E3B 5A3, Canada
基金
中国国家自然科学基金;
关键词
Vertical federated learning; tree boosting; privacy-preserving; efficiency; QUERY;
D O I
10.1109/TIFS.2022.3232955
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Aiming at balancing data privacy and availability, Google introduces the concept of federated learning, which can construct global machine learning models over multiple participants while keeping their raw data localized. However, the exchanged parameters in traditional federated learning may still reveal the data information. Meanwhile, the training data are usually partitioned vertically in real-world scenes, which causes difficulties in model construction. To tackle these problems, in this paper, we propose an efficient and privacy-preserving vertical federated tree boosting framework, namely SGBoost, where multiple participants can collaboratively perform model training and query without staying online all the time. Specifically, we first design secure bucket sharing and best split finding algorithms, with which the global tree model can be constructed over vertically partitioned data; meanwhile, the privacy of training data can be well guaranteed. Then, we design an oblivious query algorithm to utilize the trained model without leaking any query data or results. Moreover, SGBoost does not require multi-round interactions between participants, significantly improving the system efficiency. Detailed security analysis shows that SGBoost can well guarantee the privacy of raw data, weights, buckets, and split information. Extensive experiments demonstrate that SGBoost can achieve high accuracy comparable to centralized training and efficient performance.
引用
收藏
页码:1022 / 1036
页数:15
相关论文
共 50 条
  • [11] An Efficient Federated Learning Framework for Privacy-Preserving Data Aggregation in IoT
    Shi, Rongquan
    Wei, Lifei
    Zhang, Lei
    2023 20TH ANNUAL INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST, PST, 2023, : 385 - 391
  • [12] FLCP: federated learning framework with communication-efficient and privacy-preserving
    Yang, Wei
    Yang, Yuan
    Xi, Yingjie
    Zhang, Hailong
    Xiang, Wei
    APPLIED INTELLIGENCE, 2024, 54 (9-10) : 6816 - 6835
  • [13] Efficient and Privacy-Preserving Feature Importance-Based Vertical Federated Learning
    Li, Anran
    Huang, Jiahui
    Jia, Ju
    Peng, Hongyi
    Zhang, Lan
    Tuan, Luu Anh
    Yu, Han
    Li, Xiang-Yang
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 7238 - 7255
  • [14] Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation
    Yi, Jingwei
    Wu, Fangzhao
    Wu, Chuhan
    Liu, Ruixuan
    Sun, Guangzhong
    Xie, Xing
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2814 - 2824
  • [15] A efficient and robust privacy-preserving framework for cross-device federated learning
    Du, Weidong
    Li, Min
    Wu, Liqiang
    Han, Yiliang
    Zhou, Tanping
    Yang, Xiaoyuan
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 4923 - 4937
  • [16] A efficient and robust privacy-preserving framework for cross-device federated learning
    Weidong Du
    Min Li
    Liqiang Wu
    Yiliang Han
    Tanping Zhou
    Xiaoyuan Yang
    Complex & Intelligent Systems, 2023, 9 : 4923 - 4937
  • [17] FLZip: An Efficient and Privacy-Preserving Framework for Cross-Silo Federated Learning
    Feng, Xiaojie
    Du, Haizhou
    IEEE CONGRESS ON CYBERMATICS / 2021 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS (ITHINGS) / IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) / IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) / IEEE SMART DATA (SMARTDATA), 2021, : 209 - 216
  • [18] Privacy-preserving boosting
    Sébastien Gambs
    Balázs Kégl
    Esma Aïmeur
    Data Mining and Knowledge Discovery, 2007, 14 : 131 - 170
  • [19] Privacy-preserving boosting
    Gambs, Sebastien
    Kegl, Balazs
    Aimeur, Esma
    DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 14 (01) : 131 - 170
  • [20] An Efficient and Secure Privacy-Preserving Federated Learning Framework Based on Multiplicative Double Privacy Masking
    Shen, Cong
    Zhang, Wei
    Zhou, Tanping
    Zhang, Yiming
    Zhang, Lingling
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4729 - 4748