Entropy-Guided Distributional Reinforcement Learning with Controlling Uncertainty in Robotic Tasks

被引:0
|
作者
Cho, Hyunjin [1 ]
Kim, Hyunseok [1 ]
机构
[1] Dong A Univ, Dept Comp Engn, Busan 49315, South Korea
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 05期
关键词
distributional reinforcement learning; offline reinforcement learning; robot learning;
D O I
10.3390/app15052773
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This study proposes a novel approach to enhance the stability and performance of reinforcement learning (RL) in long-horizon tasks. Overestimation bias in value function estimation and high uncertainty within environments make it difficult to determine the optimal action. To address this, we improve the truncated quantile critics algorithm by managing uncertainty in robotic applications. Our dynamic method adjusts the discount factor based on policy entropy, allowing for fine-tuning that reflects the agent's learning status. This enables the existing algorithm to learn stably even in scenarios with limited training data, ensuring more robust adaptation. By leveraging policy entropy loss, this approach effectively boosts confidence in predicting future rewards. Our experiments demonstrated an 11% increase in average evaluation return compared to traditional fixed-discount-factor approaches in the DeepMind Control Suite and Gymnasium robotics environments. This approach significantly enhances sample efficiency and adaptability in complex long-horizon tasks, highlighting the effectiveness of entropy-guided RL in navigating challenging and uncertain environments.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A Quantized Kernel Least Mean Square Scheme with Entropy-Guided Learning for Intelligent Data Analysis
    Xiong Luo
    Jing Deng
    Ji Liu
    Weiping Wang
    Xiaojuan Ban
    Jenq-Haur Wang
    中国通信, 2017, 14 (07) : 127 - 136
  • [22] rl_reach: Reproducible reinforcement learning experiments for robotic tasks
    Aumjaud, Pierre
    McAuliffe, David
    Rodriguez Lera, Francisco J.
    Cardiff, Philip
    SOFTWARE IMPACTS, 2021, 8
  • [23] Quantifying the effect of feedback frequency in interactive reinforcement learning for robotic tasks
    Daniel Harnack
    Julie Pivin-Bachler
    Nicolás Navarro-Guerrero
    Neural Computing and Applications, 2023, 35 : 16931 - 16943
  • [24] Ultrasound-Guided Robotic Navigation with Deep Reinforcement Learning
    Hase, Hannes
    Azampour, Mohammad Farid
    Tirindelli, Maria
    Paschali, Magdalini
    Simson, Walter
    Fatemizadeh, Emad
    Navab, Nassir
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5534 - 5541
  • [25] A review on reinforcement learning for contact-rich robotic manipulation tasks
    Elguea-Aguinaco, Inigo
    Serrano-Munoz, Antonio
    Chrysostomou, Dimitrios
    Inziarte-Hidalgo, Ibai
    Bogh, Simon
    Arana-Arexolaleiba, Nestor
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 81
  • [26] Quantifying the effect of feedback frequency in interactive reinforcement learning for robotic tasks
    Harnack, Daniel
    Pivin-Bachler, Julie
    Navarro-Guerrero, Nicolas
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (23): : 16931 - 16943
  • [27] Exploring spiking neural networks for deep reinforcement learning in robotic tasks
    Zanatta, Luca
    Barchi, Francesco
    Manoni, Simone
    Tolu, Silvia
    Bartolini, Andrea
    Acquaviva, Andrea
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [28] Adaptive Discount Factor for Deep Reinforcement Learning in Continuing Tasks with Uncertainty
    Kim, MyeongSeop
    Kim, Jung-Su
    Choi, Myoung-Su
    Park, Jae-Han
    SENSORS, 2022, 22 (19)
  • [29] Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization
    Zhang, Yuan
    Wang, Jianhong
    Boedecker, Joschka
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [30] Controlling Soft Robotic Arms Using Hybrid Modelling and Reinforcement Learning
    Lou, Gaoming
    Wang, Chuang
    Xu, Zefeng
    Liang, Jiaqiao
    Zhou, Yitong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7070 - 7077