Relevance Vector Sampling for Reinforcement Learning in Continuous Action Space

被引:0
|
作者
Lee, Minwoo [1 ]
Anderson, Charles W. [1 ]
机构
[1] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
关键词
BIOMECHANICS;
D O I
10.1109/ICMLA.2016.150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To be applicable to real world problems, much reinforcement learning (RL) research has focused on continuous state spaces with function approximations. Some problems also require continuous actions, but searching for good actions in a continuous action space is problematic. This paper suggests a novel relevance vector sampling approach to action search in an RL framework with relevance vector machines (RVM-RL). We hypothesize that each relevance vector (RV) is placed on the modes of the value approximation surface as the learning converges. From the hypothesis, we select actions in RVs to maximize the estimated state-action values. We report the efficiency of the proposed approach by controlling a simulated octopus arm with RV-sampled actions.
引用
收藏
页码:774 / 779
页数:6
相关论文
共 50 条
  • [1] Switching reinforcement learning for continuous action space
    Nagayoshi, Masato
    Murao, Hajime
    Tamaki, Hisashi
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2012, 95 (03) : 37 - 44
  • [2] A reinforcement learning with switching controllers for a continuous action space
    Nagayoshi, Masato
    Murao, Hajime
    Tamaki, Hisashi
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2010, 15 (01) : 97 - 100
  • [3] Reinforcement learning algorithm with CTRNN in continuous action space
    Arie, Hiroaki
    Namikawa, Jun
    Ogata, Tetsuya
    Tani, Jun
    Sugano, Shigeki
    [J]. NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2006, 4232 : 387 - 396
  • [4] Soft Action Particle Deep Reinforcement Learning for a Continuous Action Space
    Kang, Minjae
    Lee, Kyungjae
    Oh, Songhwai
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 5028 - 5033
  • [5] Reinforcement Learning For Sepsis Treatment: A Continuous Action Space Solution
    Huang, Yong
    Cao, Rui
    Rahmani, Amir
    [J]. MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 182, 2022, 182 : 631 - 647
  • [6] Algorithmic trading using continuous action space deep reinforcement learning
    Majidi, Naseh
    Shamsi, Mahdi
    Marvasti, Farokh
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [7] Deep reinforcement learning in continuous action space for autonomous robotic surgery
    Shahkoo, Amin Abbasi
    Abin, Ahmad Ali
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (03) : 423 - 431
  • [8] Deep reinforcement learning in continuous action space for autonomous robotic surgery
    Amin Abbasi Shahkoo
    Ahmad Ali Abin
    [J]. International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 423 - 431
  • [9] Human-in-the-Loop Reinforcement Learning in Continuous-Action Space
    Luo, Biao
    Wu, Zhengke
    Zhou, Fei
    Wang, Bing-Chuan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 10
  • [10] Reinforcement learning with continuous vector output
    Li, Q
    Zhu, HH
    Lin, LM
    Yan, GZ
    [J]. SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 188 - 193