Risk-Aware Reinforcement Learning-Based Federated Learning for IoV Systems

被引:0
|
作者
Lu, Xiaozhen [1 ,2 ]
Liu, Zhibo [1 ]
Chen, Yuhan [1 ]
Xiao, Liang [3 ]
Wang, Wei [4 ]
Wu, Qihui [4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
[2] Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 210016, Peoples R China
[3] Xiamen Univ, Dept Informat & Commun Engn, Xiamen 361005, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Nanjing 210016, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Accuracy; Computational modeling; Fuzzy logic; Training data; Task analysis; Quality of service; Federated learning; Internet of Vehicles; reinforcement learning; selfish node;
D O I
10.1109/TMC.2024.3447034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning (FL) that improves data privacy reduces the computational overhead for Internet of Vehicles (IoV) systems but has difficulty in defending against selfish attacks due to the restricted quality of service requirements and the high mobility of vehicles. In this paper, we design a risk-aware hierarchical reinforcement learning-based FL framework for IoV to resist selfish attacks. By designing a two-level hierarchical policy selection module that consists of two deep neural networks, this framework divides the training policy into two sub-policies, i.e., the selection of FL participants and the corresponding local training data size, which are chosen based on the previous training performance and vehicle participation performance. This framework designs a risk-aware safety guide to avoid dangerous states such as local task failure resulting from risky training policies. Specifically, the guide uses a warning signal to evaluate the short-term risk of each state-action pair, applies an R-network to estimate the long-term risks for modifying the chosen training policy, and designs a punishment function for the modified training policy to revise the immediate reward to further enhance the safe exploration. We analyze the convergence performance and computational complexity of our scheme. Experimental results on MNIST, CIFAR-10, and Stanford Cars datasets verify the effectiveness of our scheme, including the global model accuracy, training latency, detection success rate, and convergence speed compared with the benchmarks FedAvg, MFL, DQNPS, and SHRL.
引用
收藏
页码:14672 / 14688
页数:17
相关论文
共 50 条
  • [41] Decentralized Cluster Head Selection in IoV using Federated Deep Reinforcement Learning
    Scott, Chandler
    Khan, Mohammad S.
    Paranjothi, Anirudh
    Li, Joshua Qiang
    2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024,
  • [42] Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning
    Hayes, Conor F.
    Reymond, Mathieu
    Roijers, Diederik M.
    Howley, Enda
    Mannion, Patrick
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2023, 37 (02)
  • [43] ARSL-V: A risk-aware relay selection scheme using reinforcement learning in VANETs
    Liu, Xuejiao
    Wang, Chuanhua
    Huang, Lingfeng
    Xia, Yingjie
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2024, 17 (03) : 1750 - 1767
  • [44] End-to-End Risk-aware Reinforcement Learning to Detect Asymptomatic Cases in Healthcare Facilities
    Thong, Yongjian
    Huang, Weiyu
    Adhikari, Bijaya
    2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, : 83 - 92
  • [45] Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning
    Conor F. Hayes
    Mathieu Reymond
    Diederik M. Roijers
    Enda Howley
    Patrick Mannion
    Autonomous Agents and Multi-Agent Systems, 2023, 37
  • [46] Machine Learning-Based Risk-Aware Congestion Control Scheme for Minimization of Information Loss in Dense VANET Environment
    Dhakad, Bhupendra
    Shrivastava, Laxmi
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (09)
  • [47] Learning Risk-Aware Costmaps for Traversability in Challenging Environments
    Fan, David D.
    Agha-mohammadi, Ali-akbar
    Theodorou, Evangelos A.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01) : 279 - 286
  • [48] Risk-Aware High-level Decisions for Automated Driving at Occluded Intersections with Reinforcement Learning
    Kamran, Danial
    Lopez, Carlos Fernandez
    Lauer, Martin
    Stiller, Christoph
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1205 - 1212
  • [49] Federated Deep Reinforcement Learning-Based Spectrum Access Algorithm With Warranty Contract in Intelligent Transportation Systems
    Zhu, Rongbo
    Li, Mengyao
    Liu, Hao
    Liu, Lu
    Ma, Maode
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) : 1178 - 1190
  • [50] IoV-BCFL: An intrusion detection method for IoV based on blockchain and federated learning
    Xie, Nannan
    Zhang, Chuanxue
    Yuan, Qizhao
    Kong, Jing
    Di, Xiaoqiang
    AD HOC NETWORKS, 2024, 163