Risk-Aware Reinforcement Learning-Based Federated Learning for IoV Systems

被引：0

作者：

Lu, Xiaozhen ^{[1
,2
]}

Liu, Zhibo ^{[1
]}

Chen, Yuhan ^{[1
]}

Xiao, Liang ^{[3
]}

Wang, Wei ^{[4
]}

Wu, Qihui ^{[4
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China

[2] Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 210016, Peoples R China

[3] Xiamen Univ, Dept Informat & Commun Engn, Xiamen 361005, Peoples R China

[4] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Nanjing 210016, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Training; Accuracy; Computational modeling; Fuzzy logic; Training data; Task analysis; Quality of service; Federated learning; Internet of Vehicles; reinforcement learning; selfish node;

D O I：

10.1109/TMC.2024.3447034

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Federated learning (FL) that improves data privacy reduces the computational overhead for Internet of Vehicles (IoV) systems but has difficulty in defending against selfish attacks due to the restricted quality of service requirements and the high mobility of vehicles. In this paper, we design a risk-aware hierarchical reinforcement learning-based FL framework for IoV to resist selfish attacks. By designing a two-level hierarchical policy selection module that consists of two deep neural networks, this framework divides the training policy into two sub-policies, i.e., the selection of FL participants and the corresponding local training data size, which are chosen based on the previous training performance and vehicle participation performance. This framework designs a risk-aware safety guide to avoid dangerous states such as local task failure resulting from risky training policies. Specifically, the guide uses a warning signal to evaluate the short-term risk of each state-action pair, applies an R-network to estimate the long-term risks for modifying the chosen training policy, and designs a punishment function for the modified training policy to revise the immediate reward to further enhance the safe exploration. We analyze the convergence performance and computational complexity of our scheme. Experimental results on MNIST, CIFAR-10, and Stanford Cars datasets verify the effectiveness of our scheme, including the global model accuracy, training latency, detection success rate, and convergence speed compared with the benchmarks FedAvg, MFL, DQNPS, and SHRL.

引用

页码：14672 / 14688

页数：17

共 50 条

[31] DeepScalper: A Risk-Aware Reinforcement Learning Framework to Capture Fleeting Intraday Trading Opportunities
Sun, Shuo
Xue, Wanqi
Wang, Rundong
He, Xu
Zhu, Junlei
Li, Jian
An, Bo
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1858 - 1867
[32] Dependency-aware online task offloading based on deep reinforcement learning for IoV
Liu, Chunhong
Wang, Huaichen
Zhao, Mengdi
Liu, Jialei
Zhao, Xiaoyan
Yuan, Peiyan
JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2024, 13 (01):
[33] A Data Sharing Scheme Based on Federated Learning in IoV
Hu, Xiaoya
Li, Ruiqin
Wang, Licheng
Ning, Yuqiao
Ota, Kaoru
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11644 - 11656
[34] Testing the Plasticity of Reinforcement Learning-based Systems
Biagiola, Matteo
Tonella, Paolo
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2022, 31 (04)
[35] Federated Deep Reinforcement Learning-Based Multi-UAV Navigation for Heterogeneous NOMA Systems
Rezwan, Sifat
Chun, Chanjun
Choi, Wooyeol
IEEE SENSORS JOURNAL, 2023, 23 (23) : 29722 - 29732
[36] A Learning-Based Incentive Mechanism for Federated Learning
Zhan, Yufeng
Li, Peng
Qu, Zhihao
Zeng, Deze
Guo, Song
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07): : 6360 - 6368
[37] FRL-FI: Transient Fault Analysis for Federated Reinforcement Learning-Based Navigation Systems
Wan, Zishen
Anwar, Aqeel
Mahmoud, Abdulrahman
Jia, Tianyu
Iisiao, Yu-Shun
Reddi, Vijay Janapa
Raychowdhury, Arijit
PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 430 - 435
[38] Deep Learning Quadcopter Control via Risk-Aware Active Learning
Andersson, Olov
Wzorek, Mariusz
Doherty, Patrick
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3812 - 3818
[39] A Federated Learning and Deep Reinforcement Learning-Based Method with Two Types of Agents for Computation Offload
Liu, Song
Yang, Shiyuan
Zhang, Hanze
Wu, Weiguo
SENSORS, 2023, 23 (04)
[40] Deep Reinforcement Learning-Based Resource Allocation for UAV-Enabled Federated Edge Learning
Liu T.
Zhang T.K.
Loo J.
Wang Y.P.
Journal of Communications and Information Networks, 2023, 8 (01)

← 1 2 3 4 5 →