Design and application of deep reinforcement learning algorithms based on unbiased exploration strategies for value functions

被引:0
|
作者
Lv, Pingli [1 ]
机构
[1] School of information engineering, Xuzhou College of Industrial Technology, Jiangsu Province, Xuzhou,221140, China
来源
Measurement: Sensors | 2024年 / 34卷
关键词
D O I
10.1016/j.measen.2024.101241
中图分类号
学科分类号
摘要
28
引用
下载
收藏
相关论文
共 50 条
  • [1] Reward poisoning attacks in deep reinforcement learning based on exploration strategies
    Cai, Kanting
    Zhu, Xiangbin
    Hu, Zhaolong
    NEUROCOMPUTING, 2023, 553
  • [2] Exploring the design of reward functions in deep reinforcement learning-based vehicle velocity control algorithms
    He, Yixu
    Liu, Yang
    Yang, Lan
    Qu, Xiaobo
    TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2024, 16 (10): : 1338 - 1352
  • [3] The Deep Quality-Value Family of Deep Reinforcement Learning Algorithms
    Sabatelli, Matthia
    Louppe, Gilles
    Geurts, Pierre
    Wiering, Marco A.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] Deep Reinforcement Learning with Feedback-based Exploration
    Scholten, Jan
    Wout, Daan
    Celemin, Carlos
    Kober, Jens
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 803 - 808
  • [5] #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
    Tang, Haoran
    Houthooft, Rein
    Foote, Davis
    Stooke, Adam
    Chen, Xi
    Duan, Yan
    Schulman, John
    De Turck, Filip
    Abbeel, Pieter
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [6] Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
    Chandak, Yash
    Thakoor, Shantanu
    Guo, Zhaohan Daniel
    Tang, Yunhao
    Munos, Remi
    Dabney, Will
    Borsa, Diana
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [7] Difference Based Metrics for Deep Reinforcement Learning Algorithms
    de Oliveira, Bernardo A. G.
    Martins, Carlos A. P. da S.
    Magalhaes, Flavia
    Goes, Luis Fabricio W.
    IEEE ACCESS, 2019, 7 : 159141 - 159149
  • [8] GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms
    Colas, Cedric
    Sigaud, Olivier
    Oudeyer, Pierre-Yves
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [9] POLICY AUGMENTATION: AN EXPLORATION STRATEGY FOR FASTER CONVERGENCE OF DEEP REINFORCEMENT LEARNING ALGORITHMS
    Mahyari, Arash
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3505 - 3509
  • [10] Adaptive Exploration Strategies for Reinforcement Learning
    Hwang, Kao-Shing
    Li, Chih-Wen
    Jiang, Wei-Cheng
    2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 16 - 19