Exploring the design of reward functions in deep reinforcement learning-based vehicle velocity control algorithms

被引:1
|
作者
He, Yixu [1 ]
Liu, Yang [2 ]
Yang, Lan [1 ,4 ]
Qu, Xiaobo [3 ]
机构
[1] Changan Univ, Sch Informat Engn, Xian, Peoples R China
[2] Chalmers Univ Technol, Dept Architecture & Civil Engn, Gothenburg, Sweden
[3] Tsinghua Univ, Sch Vehicle & Mobil, Beijing, Peoples R China
[4] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; reward function; vehicle velocity control; AUTONOMOUS VEHICLES; EFFICIENT; MODEL; GO;
D O I
10.1080/19427867.2024.2305018
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The application of deep reinforcement learning (DRL) techniques in intelligent transportation systems garners significant attention. In this field, reward function design is a crucial factor for DRL performance. Current research predominantly relies on a trial-and-error approach for designing reward functions, lacking mathematical support and necessitating extensive empirical experimentation. Our research uses vehicle velocity control as a case study, build training and test sets, and develop a DRL framework for speed control. This framework examines both single-objective and multi-objective optimization in reward function designs. In single-objective optimization, we introduce "expected optimal velocity" as an optimization objective and analyze how different reward functions affect performance, providing a mathematical perspective on optimizing reward functions. In multi-objective optimization, we propose a reward function design paradigm and validate its effectiveness. Our findings offer a versatile framework and theoretical guidance for developing and optimizing reward functions in DRL, particularly for intelligent transportation systems.
引用
收藏
页码:1338 / 1352
页数:15
相关论文
共 50 条
  • [1] Deep Adaptive Control: Deep Reinforcement Learning-Based Adaptive Vehicle Trajectory Control Algorithms for Different Risk Levels
    He, Yixu
    Liu, Yang
    Yang, Lan
    Qu, Xiaobo
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1654 - 1666
  • [2] Reward Mechanism Design for Deep Reinforcement Learning-Based Microgrid Energy Management
    Hu, Mingjie
    Han, Baohui
    Lv, Shilin
    Bao, Zhejing
    Lu, Lingxia
    Yu, Miao
    [J]. 2023 6TH INTERNATIONAL CONFERENCE ON RENEWABLE ENERGY AND POWER ENGINEERING, REPE 2023, 2023, : 201 - 205
  • [3] Deep reinforcement learning-based rehabilitation robot trajectory planning with optimized reward functions
    Wang, Xusheng
    Xie, Jiexin
    Guo, Shijie
    Li, Yue
    Sun, Pengfei
    Gan, Zhongxue
    [J]. ADVANCES IN MECHANICAL ENGINEERING, 2021, 13 (12)
  • [4] Deep reinforcement learning with reward design for quantum control
    Yu, Haixu
    Zhao, Xudong
    [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (03): : 1087 - 1101
  • [5] Exploring the Effectiveness of Different State Spaces and Reward Functions in Reinforcement Learning-based Control of a DC/DC Buck Converter
    Ranjbaran, Parisa
    Shahnooshi, Shima
    Ebrahimi, Javad
    [J]. 2023 25TH EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS, EPE'23 ECCE EUROPE, 2023,
  • [6] Deep Reinforcement Learning-based Image Captioning with Embedding Reward
    Ren, Zhou
    Wang, Xiaoyu
    Zhang, Ning
    Lv, Xutao
    Li, Li-Jia
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1151 - 1159
  • [7] Algorithms for dynamic control of a deep-sea mining vehicle based on deep reinforcement learning
    Chen, Qihang
    Yang, Jianmin
    Zhao, Wenhua
    Tao, Longbin
    Mao, Jinghang
    Li, Zhiyuan
    [J]. OCEAN ENGINEERING, 2024, 298
  • [8] An Assessment of Multistage Reward Function Design for Deep Reinforcement Learning-Based Microgrid Energy Management
    Goh, Hui Hwang
    Huang, Yifeng
    Lim, Chee Shen
    Zhang, Dongdong
    Liu, Hui
    Dai, Wei
    Kurniawan, Tonni Agustiono
    Rahman, Saifur
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (06) : 4300 - 4311
  • [9] A deep reinforcement learning-based approach for autonomous lane-changing velocity control in mixed flow of vehicle group level
    Wang, Zhe
    Huang, Helai
    Tang, Jinjun
    Hu, Lipeng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [10] Robust Adaptive Scaffolding with Inverse Reinforcement Learning-Based Reward Design
    Fahid, Fahmid Morshed
    Rowe, Jonathan P.
    Spain, Randall D.
    Goldberg, Benjamin S.
    Pokorny, Robert
    Lester, James
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS AND DOCTORAL CONSORTIUM, PT II, 2022, 13356 : 204 - 207