Exploring the design of reward functions in deep reinforcement learning-based vehicle velocity control algorithms

被引:1
|
作者
He, Yixu [1 ]
Liu, Yang [2 ]
Yang, Lan [1 ,4 ]
Qu, Xiaobo [3 ]
机构
[1] Changan Univ, Sch Informat Engn, Xian, Peoples R China
[2] Chalmers Univ Technol, Dept Architecture & Civil Engn, Gothenburg, Sweden
[3] Tsinghua Univ, Sch Vehicle & Mobil, Beijing, Peoples R China
[4] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; reward function; vehicle velocity control; AUTONOMOUS VEHICLES; EFFICIENT; MODEL; GO;
D O I
10.1080/19427867.2024.2305018
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The application of deep reinforcement learning (DRL) techniques in intelligent transportation systems garners significant attention. In this field, reward function design is a crucial factor for DRL performance. Current research predominantly relies on a trial-and-error approach for designing reward functions, lacking mathematical support and necessitating extensive empirical experimentation. Our research uses vehicle velocity control as a case study, build training and test sets, and develop a DRL framework for speed control. This framework examines both single-objective and multi-objective optimization in reward function designs. In single-objective optimization, we introduce "expected optimal velocity" as an optimization objective and analyze how different reward functions affect performance, providing a mathematical perspective on optimizing reward functions. In multi-objective optimization, we propose a reward function design paradigm and validate its effectiveness. Our findings offer a versatile framework and theoretical guidance for developing and optimizing reward functions in DRL, particularly for intelligent transportation systems.
引用
下载
收藏
页码:1338 / 1352
页数:15
相关论文
共 50 条
  • [31] Learning-Based Control Design for Deep Brain Stimulation
    Jovanov, Ilija
    Naumann, Michael
    Kumaravelu, Karthik
    Lesi, Vuk
    Zutshi, Aditya
    Grill, Warren M.
    Pajic, Miroslav
    2018 9TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2018), 2018, : 349 - 350
  • [32] Deep reinforcement learning for traffic signal control with consistent state and reward design approach
    Bouktif, Salah
    Cheniki, Abderraouf
    Ouni, Ali
    El-Sayed, Hesham
    KNOWLEDGE-BASED SYSTEMS, 2023, 267
  • [33] Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing
    Xu, Xibao
    Chen, Yushen
    Bai, Chengchao
    SENSORS, 2021, 21 (23)
  • [34] Deep Reinforcement Learning-based Edge Caching for Industrial Control Applications
    Zhang, Lei
    Xu, Hao
    Wang Guilin
    Yan, Wang
    Wang, Xiaojun
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 5024 - 5029
  • [35] Deep reinforcement learning-based drift parking control of automated vehicles
    LENG Bo
    YU YiZe
    LIU Ming
    CAO Lei
    YANG Xing
    XIONG Lu
    Science China(Technological Sciences), 2023, 66 (04) : 1152 - 1165
  • [36] Deep reinforcement learning-based drift parking control of automated vehicles
    Bo Leng
    YiZe Yu
    Ming Liu
    Lei Cao
    Xing Yang
    Lu Xiong
    Science China Technological Sciences, 2023, 66 : 1152 - 1165
  • [37] Deep Reinforcement Learning-Based Control Framework for Radio Access Networks
    Ahmed, Azza H.
    Elmokashfi, Ahmed
    PROCEEDINGS OF THE 2022 THE 28TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, ACM MOBICOM 2022, 2022, : 897 - 899
  • [38] A deep reinforcement learning-based active suspension control algorithm considering deterministic experience tracing for autonomous vehicle
    Wang, Cheng
    Cui, Xiaoxian
    Zhao, Shijie
    Zhou, Xinran
    Song, Yaqi
    Wang, Yang
    Guo, Konghui
    APPLIED SOFT COMPUTING, 2024, 153
  • [39] Deep reinforcement learning-based digital twin for droplet microfluidics control
    Gyimah, Nafisat
    Scheler, Ott
    Rang, Toomas
    Pardy, Tamas
    PHYSICS OF FLUIDS, 2023, 35 (08)
  • [40] Deep reinforcement learning-based secondary control for microgrids in islanded mode
    Barbalho, P. I. N.
    Lacerda, V. A.
    Fernandes, R. A. S.
    Coury, D., V
    ELECTRIC POWER SYSTEMS RESEARCH, 2022, 212