VFFG: Verifiable Privacy-Enhanced Federated Fine-Tuning for GPT Service

被引:0
|
作者
Bian, Mingyun [1 ,2 ]
Ren, Yanli [1 ]
He, Guanghui [1 ]
Feng, Guorui [1 ]
Zhang, Xinpeng [1 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
[2] Qufu Normal Univ, Schoolof Cyber Sci & Engn, Qufu 273165, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Data models; Computational modeling; Training; Data privacy; Servers; Polynomials; Homomorphic encryption; Federated learning; Degradation; Transformers; fine-tuning; generative pre-trained transformer; homomorphic encryption; verifiability;
D O I
10.1109/TETCI.2024.3502411
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, the generative pre-trained transformer (GPT) models with intrinsic traits have been widely employed in tackling a variety of natural language process tasks. Federated learning facilitates collaborative learning across isolated data silos, entailing risks to sensitive data and proprietary models. Prior works on secure GPT-2 services focused on protect confidential data at the cost of utility degradation, leaving fine-tuned models and feedback results vulnerable to malicious server. To accomplish a higher level of security preservation while maintaining model utility, we design the first verifiable privacy-enhanced federated GPT-2 fine-tuning system (VFFG) with dropout-resilience. VFFG leverages homomorphic encryption and pseudorandom techniques to ensure the privacy of local sensitive data and fine-tuned model parameters while also guaranteeing the reliability of feedback results to resist the tampering attacks. Security analysis theoretically proves that VFFG obtains a higher privacy level compared to previous works and a constant complexity of verification. Extensive evaluations on three types of large language models and four public datasets indicate that VFFG quantitatively outperforms the related work under multiple evaluation criteria.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Scaling Federated Learning for Fine-Tuning of Large Language Models
    Hilmkil, Agrin
    Callh, Sebastian
    Barbieri, Matteo
    Sutfeld, Leon Rene
    Zec, Edvin Listo
    Mogren, Olof
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2021), 2021, 12801 : 15 - 23
  • [42] RR-LADP: A Privacy-Enhanced Federated Learning Scheme for Internet of Everything
    Li, Zerui
    Tian, Yuchen
    Liao, Qing
    Zhang, Weizhe
    Liu, Yang
    Du, Xiaojiang
    Guizani, Mohsen
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2021, 10 (05) : 93 - 101
  • [43] PCFed: Privacy-Enhanced and Communication-Efficient Federated Learning for Industrial IoTs
    Han, Qing
    Yang, Shusen
    Ren, Xuebin
    Zhao, Peng
    Zhao, Cong
    Wang, Yimeng
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (09) : 6181 - 6191
  • [44] SPEFL: Efficient Security and Privacy-Enhanced Federated Learning Against Poisoning Attacks
    Shen, Liyan
    Ke, Zhenhan
    Shi, Jinqiao
    Zhang, Xi
    Sun, Yanwei
    Zhao, Jiapeng
    Wang, Xuebin
    Zhao, Xiaojie
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (08): : 13437 - 13451
  • [45] Fault diagnosis of intelligent distribution system based on privacy-enhanced federated learning
    陈益芳
    SUN Zhiqing
    XUAN Yi
    LOU Yinan
    WANG Qifeng
    GUO Fanghong
    High Technology Letters, 2024, 30 (04) : 424 - 432
  • [46] Fault diagnosis of intelligent distribution system based on privacy-enhanced federated learning
    Chen, Yifang
    Sun, Zhiqing
    Xuan, Yi
    Lou, Yinan
    Wang, Qifeng
    Guo, Fanghong
    High Technology Letters, 2024, 30 (04) : 424 - 432
  • [47] Short Answer Questions Generation by Fine-Tuning BERT and GPT-2
    Tsai, Danny C. L.
    Chang, Willy J. W.
    Yang, Stephen J. H.
    29TH INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION (ICCE 2021), VOL II, 2021, : 508 - 514
  • [48] Privacy-Enhanced Deniable Authentication E-Mail Service
    Ki, JuHee
    Hwang, Jung Yeong
    Nyang, DaeHun
    Lee, Dong Hoon
    Lim, Jongin
    DIGITAL ENTERPRISE AND INFORMATION SYSTEMS, 2011, 194 : 16 - +
  • [49] FedSelect: Personalized Federated Learning with Customized Selection of Parameters for Fine-Tuning
    Tamirisa, Rishub
    Xie, Chulin
    Bao, Wenxuan
    Zhou, Andy
    Arel, Ron
    Shamsian, Aviv
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23985 - 23994
  • [50] Federated Fine-Tuning of LLMs on the Very Edge: The Good, the Bad, the Ugly
    Woisetschlaeger, Herbert
    Erben, Alexander
    Wang, Shiqiang
    Mayer, Ruben
    Jacobsen, Hans-Arno
    PROCEEDINGS OF THE 8TH WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2024, 2024,