SGBoost: An Efficient and Privacy-Preserving Vertical Federated Tree Boosting Framework

被引：8

作者：

Zhao, Jiaqi ^{[1
]}

Zhu, Hui ^{[1
]}

Xu, Wei ^{[1
]}

Wang, Fengwei ^{[1
]}

Lu, Rongxing ^{[2
]}

Li, Hui ^{[1
]}

机构：

[1] Xidian Univ, Sch Cyber Engn, Xian 710126, Shaanxi, Peoples R China

[2] Univ New Brunswick, Fac Comp Sci, Fredericton, NB E3B 5A3, Canada

来源：

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY | 2023年 / 18卷

基金：

中国国家自然科学基金;

关键词：

Vertical federated learning; tree boosting; privacy-preserving; efficiency; QUERY;

D O I：

10.1109/TIFS.2022.3232955

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Aiming at balancing data privacy and availability, Google introduces the concept of federated learning, which can construct global machine learning models over multiple participants while keeping their raw data localized. However, the exchanged parameters in traditional federated learning may still reveal the data information. Meanwhile, the training data are usually partitioned vertically in real-world scenes, which causes difficulties in model construction. To tackle these problems, in this paper, we propose an efficient and privacy-preserving vertical federated tree boosting framework, namely SGBoost, where multiple participants can collaboratively perform model training and query without staying online all the time. Specifically, we first design secure bucket sharing and best split finding algorithms, with which the global tree model can be constructed over vertically partitioned data; meanwhile, the privacy of training data can be well guaranteed. Then, we design an oblivious query algorithm to utilize the trained model without leaking any query data or results. Moreover, SGBoost does not require multi-round interactions between participants, significantly improving the system efficiency. Detailed security analysis shows that SGBoost can well guarantee the privacy of raw data, weights, buckets, and split information. Extensive experiments demonstrate that SGBoost can achieve high accuracy comparable to centralized training and efficient performance.

引用

页码：1022 / 1036

页数：15

共 50 条

[1] VFLR: An Efficient and Privacy-Preserving Vertical Federated Framework for Logistic Regression
Zhao, Jiaqi
Zhu, Hui
Wang, Fengwei
Lu, Rongxing
Wang, Ermei
Li, Linfeng
Li, Hui
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (04) : 3326 - 3340
[2] Privacy-preserving gradient boosting tree: Vertical federated learning for collaborative bearing fault diagnosis
Xia, Liqiao
Zheng, Pai
Li, Jinjie
Tang, Wangchujun
Zhang, Xiangying
IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2022, 4 (03) : 208 - 219
[3] A Privacy-preserving Data Alignment Framework for Vertical Federated Learning
Gao, Ying
Xie, Yuxin
Deng, Huanghao
Zhu, Zukun
Zhang, Yiyu
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (08): : 3419 - 3427
[4] OpenVFL: A Vertical Federated Learning Framework With Stronger Privacy-Preserving
Yang, Yunbo
Chen, Xiang
Pan, Yuhao
Shen, Jiachen
Cao, Zhenfu
Dong, Xiaolei
Li, Xiaoguo
Sun, Jianfei
Yang, Guomin
Deng, Robert
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 9670 - 9681
[5] ELXGB: An Efficient and Privacy-Preserving XGBoost for Vertical Federated Learning
Xu, Wei
Zhu, Hui
Zheng, Yandong
Wang, Fengwei
Zhao, Jiaqi
Liu, Zhe
Li, Hui
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (03) : 878 - 892
[6] PEPFL:A framework for a practical and efficient privacy-preserving federated learning
Yange Chen
Baocang Wang
Hang Jiang
Pu Duan
Yuan Ping
Zhiyong Hong
Digital Communications and Networks, 2024, 10 (02) : 355 - 368
[7] Secure Dataset Condensation for Privacy-Preserving and Efficient Vertical Federated Learning
Gao, Dashan
Wu, Canhui
Zhang, Xiaojin
Yao, Xin
Yang, Qiang
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT I, ECML PKDD 2024, 2024, 14941 : 212 - 229
[8] Improving Privacy-Preserving Vertical Federated Learning by Efficient Communication with ADMM
Xie, Chulin
Chen, Pin-Yu
Li, Qinbin
Nourian, Arash
Zhang, Ce
Li, Bo
IEEE CONFERENCE ON SAFE AND TRUSTWORTHY MACHINE LEARNING, SATML 2024, 2024, : 443 - 471
[9] Hercules: Boosting the Performance of Privacy-Preserving Federated Learning
Xu, Guowen
Han, Xingshuo
Xu, Shengmin
Zhang, Tianwei
Li, Hongwei
Huang, Xinyi
Deng, Robert H.
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (05) : 4418 - 4433
[10] Efficient and Privacy-Preserving Outsourcing of Gradient Boosting Decision Tree Inference
Yuan, Shuai
Li, Hongwei
Qian, Xinyuan
Hao, Meng
Zhai, Yixiao
Xu, Guowen
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2334 - 2348

← 1 2 3 4 5 →