Competitive reinforcement learning in continuous control tasks

被引:0
|
作者
Abramson, M [1 ]
Pachowicz, P [1 ]
Wechsler, H [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a novel hybrid reinforcement learning algorithm, Sarsa Learning Vector Quantization (SLVQ), that leaves the reinforcement part intact but employs a more effective representation of the policy function using,a piecewise constant function based upon "policy prototypes." The prototypes correspond to the pattern classes induced by the Voronoi tessellation generated by self-organizing methods like Learning Vector Quantization (LVQ). The determination of the optimal policy function can be now viewed as a pattern recognition problem in the sense that the assignment of an action to a point in the phase space is similar to the assignment of a pattern class to a point in phase space. The distributed LVQ representation of the policy function automatically generates a piecewise constant tessellation of the state space and yields in a major simplification of the learning task relative to the standard reinforcement learning algorithms for whom a discontinuous table look function, has to be learned. The feasibility and comparative advantages of the new algorithm is shown on the cart centering and mountain car problems, two control problems of increased difficulty.
引用
收藏
页码:1909 / 1914
页数:6
相关论文
共 50 条
  • [41] A practical Reinforcement Learning implementation approach for continuous process control
    Patel, Kalpesh M.
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 174
  • [42] Intrinsically-motivated reinforcement learning for control with continuous actions
    de Abril, Ildefons Magrans
    Kanai, Ryota
    2017 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2017, : 212 - 213
  • [43] Reinforcement learning in continuous time and space: A stochastic control approach
    Wang, Haoran
    Zariphopoulou, Thaleia
    Zhou, Xun Yu
    Journal of Machine Learning Research, 2020, 21
  • [44] Automated Transfer for Reinforcement Learning Tasks
    Ammar, Haitham Bou
    Chen, Siqi
    Tuyls, Karl
    Weiss, Gerhard
    KUNSTLICHE INTELLIGENZ, 2014, 28 (01): : 7 - 14
  • [45] Strategic Tasks for Explainable Reinforcement Learning
    Pocius, Rey
    Neal, Lawrence
    Fern, Alan
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 10007 - 10008
  • [46] Continuous residual reinforcement learning for traffic signal control optimization
    Aslani, Mohainmad
    Seipel, Stefan
    Wiering, Marco
    CANADIAN JOURNAL OF CIVIL ENGINEERING, 2018, 45 (08) : 690 - 702
  • [47] Continuous Control with Deep Reinforcement Learning for Mobile Robot Navigation
    Xiang, Jiaqi
    Li, Qingdong
    Dong, Xiwang
    Ren, Zhang
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1501 - 1506
  • [48] Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning
    Asad Ali Shahid
    Dario Piga
    Francesco Braghin
    Loris Roveda
    Autonomous Robots, 2022, 46 : 483 - 498
  • [49] Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning
    Shahid, Asad Ali
    Piga, Dario
    Braghin, Francesco
    Roveda, Loris
    AUTONOMOUS ROBOTS, 2022, 46 (03) : 483 - 498
  • [50] Learning Assembly Tasks in a Few Minutes by Combining Impedance Control and Residual Recurrent Reinforcement Learning
    Kulkarni, Padmaja
    Kober, Jens
    Babuska, Robert
    Della Santina, Cosimo
    ADVANCED INTELLIGENT SYSTEMS, 2022, 4 (01)