Risk-Aware Reinforcement Learning-Based Federated Learning for IoV Systems

被引:0
|
作者
Lu, Xiaozhen [1 ,2 ]
Liu, Zhibo [1 ]
Chen, Yuhan [1 ]
Xiao, Liang [3 ]
Wang, Wei [4 ]
Wu, Qihui [4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
[2] Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 210016, Peoples R China
[3] Xiamen Univ, Dept Informat & Commun Engn, Xiamen 361005, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Nanjing 210016, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Accuracy; Computational modeling; Fuzzy logic; Training data; Task analysis; Quality of service; Federated learning; Internet of Vehicles; reinforcement learning; selfish node;
D O I
10.1109/TMC.2024.3447034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning (FL) that improves data privacy reduces the computational overhead for Internet of Vehicles (IoV) systems but has difficulty in defending against selfish attacks due to the restricted quality of service requirements and the high mobility of vehicles. In this paper, we design a risk-aware hierarchical reinforcement learning-based FL framework for IoV to resist selfish attacks. By designing a two-level hierarchical policy selection module that consists of two deep neural networks, this framework divides the training policy into two sub-policies, i.e., the selection of FL participants and the corresponding local training data size, which are chosen based on the previous training performance and vehicle participation performance. This framework designs a risk-aware safety guide to avoid dangerous states such as local task failure resulting from risky training policies. Specifically, the guide uses a warning signal to evaluate the short-term risk of each state-action pair, applies an R-network to estimate the long-term risks for modifying the chosen training policy, and designs a punishment function for the modified training policy to revise the immediate reward to further enhance the safe exploration. We analyze the convergence performance and computational complexity of our scheme. Experimental results on MNIST, CIFAR-10, and Stanford Cars datasets verify the effectiveness of our scheme, including the global model accuracy, training latency, detection success rate, and convergence speed compared with the benchmarks FedAvg, MFL, DQNPS, and SHRL.
引用
收藏
页码:14672 / 14688
页数:17
相关论文
共 50 条
  • [31] DeepScalper: A Risk-Aware Reinforcement Learning Framework to Capture Fleeting Intraday Trading Opportunities
    Sun, Shuo
    Xue, Wanqi
    Wang, Rundong
    He, Xu
    Zhu, Junlei
    Li, Jian
    An, Bo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1858 - 1867
  • [32] Dependency-aware online task offloading based on deep reinforcement learning for IoV
    Liu, Chunhong
    Wang, Huaichen
    Zhao, Mengdi
    Liu, Jialei
    Zhao, Xiaoyan
    Yuan, Peiyan
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2024, 13 (01):
  • [33] A Data Sharing Scheme Based on Federated Learning in IoV
    Hu, Xiaoya
    Li, Ruiqin
    Wang, Licheng
    Ning, Yuqiao
    Ota, Kaoru
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11644 - 11656
  • [34] Testing the Plasticity of Reinforcement Learning-based Systems
    Biagiola, Matteo
    Tonella, Paolo
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2022, 31 (04)
  • [35] Federated Deep Reinforcement Learning-Based Multi-UAV Navigation for Heterogeneous NOMA Systems
    Rezwan, Sifat
    Chun, Chanjun
    Choi, Wooyeol
    IEEE SENSORS JOURNAL, 2023, 23 (23) : 29722 - 29732
  • [36] A Learning-Based Incentive Mechanism for Federated Learning
    Zhan, Yufeng
    Li, Peng
    Qu, Zhihao
    Zeng, Deze
    Guo, Song
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07): : 6360 - 6368
  • [37] FRL-FI: Transient Fault Analysis for Federated Reinforcement Learning-Based Navigation Systems
    Wan, Zishen
    Anwar, Aqeel
    Mahmoud, Abdulrahman
    Jia, Tianyu
    Iisiao, Yu-Shun
    Reddi, Vijay Janapa
    Raychowdhury, Arijit
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 430 - 435
  • [38] Deep Learning Quadcopter Control via Risk-Aware Active Learning
    Andersson, Olov
    Wzorek, Mariusz
    Doherty, Patrick
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3812 - 3818
  • [39] A Federated Learning and Deep Reinforcement Learning-Based Method with Two Types of Agents for Computation Offload
    Liu, Song
    Yang, Shiyuan
    Zhang, Hanze
    Wu, Weiguo
    SENSORS, 2023, 23 (04)
  • [40] Deep Reinforcement Learning-Based Resource Allocation for UAV-Enabled Federated Edge Learning
    Liu T.
    Zhang T.K.
    Loo J.
    Wang Y.P.
    Journal of Communications and Information Networks, 2023, 8 (01)