FEDBERT: When Federated Learning Meets Pre-training

被引:47
|
作者
Tian, Yuanyishu [1 ]
Wan, Yao [1 ]
Lyu, Lingjuan [2 ]
Yao, Dezhong [1 ]
Jin, Hai [1 ]
Sun, Lichao [3 ]
机构
[1] Huazhong Univ Sci & Technol, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Sch Comp Sci & Technol,Cluster & Grid Comp Lab, 1037 Luoyu Rd, Wuhan 430074, Peoples R China
[2] Sony AI, Minato Ku, 1-7-1 Konan, Tokyo, Japan
[3] Lehigh Univ, 113 Res Dr, Bethlehem, PA 18015 USA
基金
中国国家自然科学基金;
关键词
Federated learning; pre-training; BERT; NLP;
D O I
10.1145/3510033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fast growth of pre-trained models (PTMs) has brought natural language processing to a new era, which has become a dominant technique for various natural language processing (NLP) applications. Every user can download the weights of PTMs, then fine-tune the weights for a task on the local side. However, the pre-training of a model relies heavily on accessing a large-scale of training data and requires a vast amount of computing resources. These strict requirements make it impossible for any single client to pre-train such a model. To grant clients with limited computing capability to participate in pre-training a large model, we propose a new learning approach, FEDBERT, that takes advantage of the federated learning and split learning approaches, resorting to pre-training BERT in a federated way. FEDBERT can prevent sharing the raw data information and obtain excellent performance. Extensive experiments on seven GLUE tasks demonstrate that FEDBERT can maintain its effectiveness without communicating to the sensitive local data of clients.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] CyclicFL: Efficient Federated Learning with Cyclic Model Pre-Training
    Zhang, Pengyu
    Zhou, Yingbo
    Hu, Ming
    Wei, Xian
    Chen, Mingsong
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2025,
  • [2] FEDBFPT: An Efficient Federated Learning Framework for BERT Further Pre-training
    Wang, Xin'ao
    Li, Huan
    Chen, Ke
    Shou, Lidan
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4344 - 4352
  • [3] Pre-Training Model and Client Selection Optimization for Improving Federated Learning Efficiency
    Ge, Bingchen
    Zhou, Ying
    Xie, Liping
    Kou, Lirong
    2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 650 - 660
  • [4] Lottery Hypothesis based Unsupervised Pre-training for Model Compression in Federated Learning
    Itahara, Sohei
    Nishio, Takayuki
    Morikura, Masahiro
    Yamamoto, Koji
    2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [5] Non-Contrastive Learning Meets Language-Image Pre-Training
    Zhou, Jinghao
    Dong, Li
    Gan, Zhe
    Wang, Lijuan
    Wei, Furu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11028 - 11038
  • [6] SplitFed: When Federated Learning Meets Split Learning
    Thapa, Chandra
    Arachchige, Pathum Chamikara Mahawaga
    Camtepe, Seyit
    Sun, Lichao
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8485 - 8493
  • [7] PEPT: Expert Finding Meets Personalized Pre-Training
    Peng, Qiyao
    Xu, Hongyan
    Wang, Yinghui
    Liu, Hongtao
    Huo, Cuiying
    Wang, Wenjun
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 43 (01)
  • [8] When Blockchain Meets Asynchronous Federated Learning
    Jing, Rui
    Chen, Wei
    Wu, Xiaoxin
    Wang, Zehua
    Tian, Zijian
    Zhang, Fan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IX, ICIC 2024, 2024, 14870 : 199 - 207
  • [9] When Decentralized Optimization Meets Federated Learning
    Gao, Hongchang
    Thai, My T.
    Wu, Jie
    IEEE NETWORK, 2023, 37 (05): : 233 - 239
  • [10] WHEN FEDERATED LEARNING MEETS KNOWLEDGE DISTILLATION
    Pang, Xiaoyi
    Hu, Jiahui
    Sun, Peng
    Ren, Ju
    Wang, Zhibo
    IEEE WIRELESS COMMUNICATIONS, 2024, 31 (05) : 208 - 214