Swarm Reinforcement Learning Methods for Problems with Continuous State-Action Space

被引：0

作者：

Iima, Hitoshi ^{[1
]}

Kuroe, Yasuaki ^{[1
]}

Emoto, Kazuo ^{[1
]}

机构：

[1] Kyoto Inst Technol, Dept Informat Sci, Kyoto 606, Japan

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC) | 2011年

关键词：

reinforcement learning; swarm intelligence; particle swarm optimization;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We recently proposed swarm reinforcement learning methods in which multiple sets of an agent and an environment are prepared and the agents learn not only by individually performing a usual reinforcement learning method but also by exchanging information among them. Q-learning method has been used as the individual learning in the methods, and they have been applied to a problem with discrete state-action space. In the real world, however, there are many problems which are formulated as ones with continuous state-action space. This paper proposes swarm reinforcement learning methods based on an actor-critic method in order to acquire optimal policies rapidly for problems with continuous state-action space. The proposed methods are applied to a biped robot control problem, and their performance is examined through numerical experiments.

引用

页码：2173 / 2180

页数：8

共 50 条

[1] Near-continuous time Reinforcement Learning for continuous state-action spaces
Croissant, Lorenzo
Abeille, Marc
Bouchard, Bruno
[J]. INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
[2] A Plume-Tracing Strategy via Continuous State-action Reinforcement Learning
Niu, Lvyin
Song, Shiji
You, Keyou
[J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 759 - 764
[3] Improving state-action space exploration in reinforcement learning using geometric properties
Matei, Ion
Minhas, Raj
de Kleer, Johan
Ganguli, Anurag
[J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
[4] For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Fujimoto, Scott
Chang, Wei-Di
Smith, Edward J.
Gu, Shixiang Shane
Precup, Doina
Meger, David
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[5] Enhancing visual reinforcement learning with State-Action Representation
Yan, Mengbei
Lyu, Jiafei
Li, Xiu
[J]. KNOWLEDGE-BASED SYSTEMS, 2024, 304
[6] STATE-ACTION VALUE FUNCTION MODELED BY ELM IN REINFORCEMENT LEARNING FOR HOSE CONTROL PROBLEMS
Manuel Lopez-Guede, Jose
Fernandez-Gauna, Borja
Grana, Manuel
[J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2013, 21 : 99 - 116
[7] Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs
McAllister, Rowan Thomas
Rasmussen, Carl Edward
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[8] Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Barakat, Anas
Fatkhullin, Ilyas
He, Niao
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[9] PROJECTED STATE-ACTION BALANCING WEIGHTS FOR OFFLINE REINFORCEMENT LEARNING
Wang, Jiayi
Qi, Zhengling
Wong, Raymond K. W.
[J]. ANNALS OF STATISTICS, 2023, 51 (04): : 1639 - 1665
[10] Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning
Singh, Gurpreet
Lofaro, Daniel M.
Sofge, Donald
[J]. ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2020, : 226 - 233

← 1 2 3 4 5 →