Risk-Aware Reinforcement Learning-Based Federated Learning for IoV Systems

被引：0

作者：

Lu, Xiaozhen ^{[1
,2
]}

Liu, Zhibo ^{[1
]}

Chen, Yuhan ^{[1
]}

Xiao, Liang ^{[3
]}

Wang, Wei ^{[4
]}

Wu, Qihui ^{[4
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China

[2] Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 210016, Peoples R China

[3] Xiamen Univ, Dept Informat & Commun Engn, Xiamen 361005, Peoples R China

[4] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Nanjing 210016, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Training; Accuracy; Computational modeling; Fuzzy logic; Training data; Task analysis; Quality of service; Federated learning; Internet of Vehicles; reinforcement learning; selfish node;

D O I：

10.1109/TMC.2024.3447034

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Federated learning (FL) that improves data privacy reduces the computational overhead for Internet of Vehicles (IoV) systems but has difficulty in defending against selfish attacks due to the restricted quality of service requirements and the high mobility of vehicles. In this paper, we design a risk-aware hierarchical reinforcement learning-based FL framework for IoV to resist selfish attacks. By designing a two-level hierarchical policy selection module that consists of two deep neural networks, this framework divides the training policy into two sub-policies, i.e., the selection of FL participants and the corresponding local training data size, which are chosen based on the previous training performance and vehicle participation performance. This framework designs a risk-aware safety guide to avoid dangerous states such as local task failure resulting from risky training policies. Specifically, the guide uses a warning signal to evaluate the short-term risk of each state-action pair, applies an R-network to estimate the long-term risks for modifying the chosen training policy, and designs a punishment function for the modified training policy to revise the immediate reward to further enhance the safe exploration. We analyze the convergence performance and computational complexity of our scheme. Experimental results on MNIST, CIFAR-10, and Stanford Cars datasets verify the effectiveness of our scheme, including the global model accuracy, training latency, detection success rate, and convergence speed compared with the benchmarks FedAvg, MFL, DQNPS, and SHRL.

引用

页码：14672 / 14688

页数：17

共 50 条

[41] Decentralized Cluster Head Selection in IoV using Federated Deep Reinforcement Learning
Scott, Chandler
Khan, Mohammad S.
Paranjothi, Anirudh
Li, Joshua Qiang
2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024,
[42] Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning
Hayes, Conor F.
Reymond, Mathieu
Roijers, Diederik M.
Howley, Enda
Mannion, Patrick
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2023, 37 (02)
[43] ARSL-V: A risk-aware relay selection scheme using reinforcement learning in VANETs
Liu, Xuejiao
Wang, Chuanhua
Huang, Lingfeng
Xia, Yingjie
PEER-TO-PEER NETWORKING AND APPLICATIONS, 2024, 17 (03) : 1750 - 1767
[44] End-to-End Risk-aware Reinforcement Learning to Detect Asymptomatic Cases in Healthcare Facilities
Thong, Yongjian
Huang, Weiyu
Adhikari, Bijaya
2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, : 83 - 92
[45] Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning
Conor F. Hayes
Mathieu Reymond
Diederik M. Roijers
Enda Howley
Patrick Mannion
Autonomous Agents and Multi-Agent Systems, 2023, 37
[46] Machine Learning-Based Risk-Aware Congestion Control Scheme for Minimization of Information Loss in Dense VANET Environment
Dhakad, Bhupendra
Shrivastava, Laxmi
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (09)
[47] Learning Risk-Aware Costmaps for Traversability in Challenging Environments
Fan, David D.
Agha-mohammadi, Ali-akbar
Theodorou, Evangelos A.
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01) : 279 - 286
[48] Risk-Aware High-level Decisions for Automated Driving at Occluded Intersections with Reinforcement Learning
Kamran, Danial
Lopez, Carlos Fernandez
Lauer, Martin
Stiller, Christoph
2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1205 - 1212
[49] Federated Deep Reinforcement Learning-Based Spectrum Access Algorithm With Warranty Contract in Intelligent Transportation Systems
Zhu, Rongbo
Li, Mengyao
Liu, Hao
Liu, Lu
Ma, Maode
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) : 1178 - 1190
[50] IoV-BCFL: An intrusion detection method for IoV based on blockchain and federated learning
Xie, Nannan
Zhang, Chuanxue
Yuan, Qizhao
Kong, Jing
Di, Xiaoqiang
AD HOC NETWORKS, 2024, 163

← 1 2 3 4 5 →