FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning

被引:3
|
作者
Wei, Penghui [1 ]
Dou, Hongjian [1 ]
Liu, Shaoguo [1 ]
Tang, Rongjun [2 ]
Liu, Li [3 ]
Wang, Liang [1 ]
Zheng, Bo [1 ]
机构
[1] Alibaba Grp, Hangzhou, Peoples R China
[2] Chinese Univ Hong Kong, Shenzhen, Peoples R China
[3] Hong Kong Univ Sci & Technol Guangzhou, Thrust Artificial Intelligence, Guangzhou, Peoples R China
关键词
Ad Ranking; Vertical Federated Learning; Deep Generative Model;
D O I
10.1145/3539618.3591909
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conversion rate (CVR) estimation aims to predict the probability of conversion event after a user has clicked an ad. Typically, online publisher has user browsing interests and click feedbacks, while demand-side advertising platform collects users' post-click behaviors such as dwell time and conversion decisions. To estimate CVR accurately and protect data privacy better, vertical federated learning (vFL) is a natural solution to combine two sides' advantages for training models, without exchanging raw data. Both CVR estimation and applied vFL algorithms have attracted increasing research attentions. However, standardized and systematical evaluations are missing: due to the lack of standardized datasets, existing studies adopt public datasets to simulate a vFL setting via hand-crafted feature partition, which brings challenges to fair comparison. We introduce FedAds, the first benchmark for CVR estimation with vFL, to facilitate standardized and systematical evaluations for vFL algorithms. It contains a large-scale real world dataset collected from Alibaba's advertising platform, as well as systematical evaluations for both effectiveness and privacy aspects of various vFL algorithms. Besides, we also explore to incorporate unaligned data in vFL to improve effectiveness, and develop perturbation operations to protect privacy well. We hope that future research work in vFL and CVR estimation benefits from the FedAds benchmark.
引用
收藏
页码:3037 / 3046
页数:10
相关论文
共 50 条
  • [1] Privacy-preserving data fusion for traffic state estimation: A vertical federated learning approach
    Wang, Qiqing
    Yang, Kaidi
    [J]. Transportation Research Part C: Emerging Technologies, 2024, 168
  • [2] ELXGB: An Efficient and Privacy-Preserving XGBoost for Vertical Federated Learning
    Xu, Wei
    Zhu, Hui
    Zheng, Yandong
    Wang, Fengwei
    Zhao, Jiaqi
    Liu, Zhe
    Li, Hui
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (03) : 878 - 892
  • [3] Falcon: A Privacy-Preserving and Interpretable Vertical Federated Learning System
    Wu, Yuncheng
    Xing, Naili
    Chen, Gang
    Dinh, Tien Tuan Anh
    Luo, Zhaojing
    Ooi, Beng Chin
    Xiao, Xiaokui
    Zhang, Meihui
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (10): : 2471 - 2484
  • [4] OpenVFL: A Vertical Federated Learning Framework With Stronger Privacy-Preserving
    Yang, Yunbo
    Chen, Xiang
    Pan, Yuhao
    Shen, Jiachen
    Cao, Zhenfu
    Dong, Xiaolei
    Li, Xiaoguo
    Sun, Jianfei
    Yang, Guomin
    Deng, Robert
    [J]. IEEE Transactions on Information Forensics and Security, 2024, 19 : 9670 - 9681
  • [5] Privacy-Preserving Data Selection for Horizontal and Vertical Federated Learning
    Zhang, Lan
    Li, Anran
    Peng, Hongyi
    Han, Feng
    Huang, Fan
    Li, Xiang-Yang
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (11) : 2054 - 2068
  • [6] Privacy-Preserving Personalized Federated Learning
    Hu, Rui
    Guo, Yuanxiong
    Li, Hongning
    Pei, Qingqi
    Gong, Yanmin
    [J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [7] Frameworks for Privacy-Preserving Federated Learning
    Phong, Le Trieu
    Phuong, Tran Thi
    Wang, Lihua
    Ozawa, Seiichi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (01) : 2 - 12
  • [8] Adaptive privacy-preserving federated learning
    Liu, Xiaoyuan
    Li, Hongwei
    Xu, Guowen
    Lu, Rongxing
    He, Miao
    [J]. PEER-TO-PEER NETWORKING AND APPLICATIONS, 2020, 13 (06) : 2356 - 2366
  • [9] Privacy-Preserving and Reliable Federated Learning
    Lu, Yi
    Zhang, Lei
    Wang, Lulu
    Gao, Yuanyuan
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT III, 2022, 13157 : 346 - 361
  • [10] Federated learning for privacy-preserving AI
    Cheng, Yong
    Liu, Yang
    Chen, Tianjian
    Yang, Qiang
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (12) : 33 - 36