Swarm Reinforcement Learning Methods for Problems with Continuous State-Action Space

被引:0
|
作者
Iima, Hitoshi [1 ]
Kuroe, Yasuaki [1 ]
Emoto, Kazuo [1 ]
机构
[1] Kyoto Inst Technol, Dept Informat Sci, Kyoto 606, Japan
关键词
reinforcement learning; swarm intelligence; particle swarm optimization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We recently proposed swarm reinforcement learning methods in which multiple sets of an agent and an environment are prepared and the agents learn not only by individually performing a usual reinforcement learning method but also by exchanging information among them. Q-learning method has been used as the individual learning in the methods, and they have been applied to a problem with discrete state-action space. In the real world, however, there are many problems which are formulated as ones with continuous state-action space. This paper proposes swarm reinforcement learning methods based on an actor-critic method in order to acquire optimal policies rapidly for problems with continuous state-action space. The proposed methods are applied to a biped robot control problem, and their performance is examined through numerical experiments.
引用
收藏
页码:2173 / 2180
页数:8
相关论文
共 50 条
  • [1] Near-continuous time Reinforcement Learning for continuous state-action spaces
    Croissant, Lorenzo
    Abeille, Marc
    Bouchard, Bruno
    [J]. INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
  • [2] A Plume-Tracing Strategy via Continuous State-action Reinforcement Learning
    Niu, Lvyin
    Song, Shiji
    You, Keyou
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 759 - 764
  • [3] Improving state-action space exploration in reinforcement learning using geometric properties
    Matei, Ion
    Minhas, Raj
    de Kleer, Johan
    Ganguli, Anurag
    [J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [4] For SALE: State-Action Representation Learning for Deep Reinforcement Learning
    Fujimoto, Scott
    Chang, Wei-Di
    Smith, Edward J.
    Gu, Shixiang Shane
    Precup, Doina
    Meger, David
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Enhancing visual reinforcement learning with State-Action Representation
    Yan, Mengbei
    Lyu, Jiafei
    Li, Xiu
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [6] STATE-ACTION VALUE FUNCTION MODELED BY ELM IN REINFORCEMENT LEARNING FOR HOSE CONTROL PROBLEMS
    Manuel Lopez-Guede, Jose
    Fernandez-Gauna, Borja
    Grana, Manuel
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2013, 21 : 99 - 116
  • [7] Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs
    McAllister, Rowan Thomas
    Rasmussen, Carl Edward
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [8] Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
    Barakat, Anas
    Fatkhullin, Ilyas
    He, Niao
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [9] PROJECTED STATE-ACTION BALANCING WEIGHTS FOR OFFLINE REINFORCEMENT LEARNING
    Wang, Jiayi
    Qi, Zhengling
    Wong, Raymond K. W.
    [J]. ANNALS OF STATISTICS, 2023, 51 (04): : 1639 - 1665
  • [10] Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning
    Singh, Gurpreet
    Lofaro, Daniel M.
    Sofge, Donald
    [J]. ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2020, : 226 - 233