FEDBERT: When Federated Learning Meets Pre-training

被引:47
|
作者
Tian, Yuanyishu [1 ]
Wan, Yao [1 ]
Lyu, Lingjuan [2 ]
Yao, Dezhong [1 ]
Jin, Hai [1 ]
Sun, Lichao [3 ]
机构
[1] Huazhong Univ Sci & Technol, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Sch Comp Sci & Technol,Cluster & Grid Comp Lab, 1037 Luoyu Rd, Wuhan 430074, Peoples R China
[2] Sony AI, Minato Ku, 1-7-1 Konan, Tokyo, Japan
[3] Lehigh Univ, 113 Res Dr, Bethlehem, PA 18015 USA
基金
中国国家自然科学基金;
关键词
Federated learning; pre-training; BERT; NLP;
D O I
10.1145/3510033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fast growth of pre-trained models (PTMs) has brought natural language processing to a new era, which has become a dominant technique for various natural language processing (NLP) applications. Every user can download the weights of PTMs, then fine-tune the weights for a task on the local side. However, the pre-training of a model relies heavily on accessing a large-scale of training data and requires a vast amount of computing resources. These strict requirements make it impossible for any single client to pre-train such a model. To grant clients with limited computing capability to participate in pre-training a large model, we propose a new learning approach, FEDBERT, that takes advantage of the federated learning and split learning approaches, resorting to pre-training BERT in a federated way. FEDBERT can prevent sharing the raw data information and obtain excellent performance. Extensive experiments on seven GLUE tasks demonstrate that FEDBERT can maintain its effectiveness without communicating to the sensitive local data of clients.
引用
收藏
页数:26
相关论文
共 50 条
  • [11] Robot Learning with Sensorimotor Pre-training
    Radosavovic, Ilija
    Shi, Baifeng
    Fu, Letian
    Goldberg, Ken
    Darrell, Trevor
    Malik, Jitendra
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [12] FedELR: When federated learning meets learning with noisy labels
    Pu, Ruizhi
    Yu, Lixing
    Xu, Gezheng
    Zhou, Fan
    Ling, Charles X.
    Wang, Boyu
    NEURAL NETWORKS, 2025, 187
  • [13] Pre-training meets iteration: Learning for robust 3D point cloud denoising
    Quan, Siwen
    Zhao, Hebin
    Zeng, Zhao
    Nie, Ziming
    Yang, Jiaqi
    PATTERN RECOGNITION LETTERS, 2025, 190 : 105 - 110
  • [14] When Federated Learning Meets Blockchain: A New Distributed Learning Paradigm
    Ma, Chuan
    Li, Jun
    Shi, Long
    Ding, Ming
    Wang, Taotao
    Han, Zhu
    Poor, H. Vincent
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2022, 17 (03) : 26 - 33
  • [15] Learning to Sample Replacements for ELECTRA Pre-Training
    Hao, Yaru
    Dong, Li
    Bao, Hangbo
    Xu, Ke
    Wei, Furu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4495 - 4506
  • [16] Meta-Learning to Improve Pre-Training
    Raghu, Aniruddh
    Lorraine, Jonathan
    Kornblith, Simon
    McDermott, Matthew
    Duvenaud, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [17] Robust Pre-Training by Adversarial Contrastive Learning
    Jiang, Ziyu
    Chen, Tianlong
    Chen, Ting
    Wang, Zhangyang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [18] Multilingual Pre-training with Universal Dependency Learning
    Sun, Kailai
    Li, Zuchao
    Zhao, Hai
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [19] Learning Chemical Rules of Retrosynthesis with Pre-training
    Jiang, Yinjie
    Wei, Ying
    Wu, Fei
    Huang, Zhengxing
    Kuang, Kun
    Wang, Zhihua
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 5113 - 5121
  • [20] Efficient Conditional Pre-training for Transfer Learning
    Chakraborty, Shuvam
    Uzkent, Burak
    Ayush, Kumar
    Tanmay, Kumar
    Sheehan, Evan
    Ermon, Stefano
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4240 - 4249