A proposition of adaptive state space partition in reinforcement learning with Voronoi Tessellation

被引：0

作者：

Aung, Kathy Thi ^{[1
]}

Fuchida, Takayasu ^{[1
]}

机构：

[1] Kagoshima Univ, Grad Sch Sci & Engn, Dept Syst Informat Sci, Kohrimoto 1-21-40, Kagoshima 8900065, Japan

来源：

PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12) | 2012年

关键词：

Q-learning; LBG; new Vector quantization method; State space partitioning;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a new adaptive segmentation of continuous state space based on vector quantization algorithm such as LBG (Linde-Buzo-Gray) for high-dimensional continuous state spaces. The objective of adaptive state space partitioning is to develop the efficiency of learning reward values with an accumulation of state transition vector (STV) in a single-agent environment. We constructed our single-agent model in continuous state and discrete actions spaces using Q-learning function. Moreover, the study of the resulting state space partition reveals in a Voronoi tessellation. In addition, the experimental results show that this proposed method can partition the continuous state space appropriately into Voronoi regions according to not only the number of actions, and achieve a good performance of reward based learning tasks compared with other approaches such as square partition lattice.

引用

页码：638 / 641

页数：4

共 50 条

[1] A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation
Fuchida, Takayasu
Aung, Kathy Thi
[J]. ARTIFICIAL LIFE AND ROBOTICS, 2013, 18 (3-4) : 172 - 177
[2] Adaptive state space formation in reinforcement learning
Samejima, K
Omori, T
[J]. ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 251 - 255
[3] Adaptive state space partitioning for reinforcement learning
Lee, ISK
Lau, HYK
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, 17 (06) : 577 - 588
[4] Reinforcement learning using Voronoi space division
Aung, Kathy
Fuchida, Takayasu
[J]. ARTIFICIAL LIFE AND ROBOTICS, 2010, 15 (03) : 330 - 334
[5] Adaptive Coda-Wave Imaging With Voronoi Tessellation
Mao, Shujuan
Ellsworth, William L.
Beroza, Gregory C.
[J]. JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2023, 128 (08)
[6] A reinforcement learning with adaptive state space construction for mobile robot navigation
Li, Guizhi
Pang, Jie
[J]. PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, 2006, : 84 - 88
[7] Centroidal Voronoi tessellation in universal covering space of manifold surfaces
Rong, Guodong
Jin, Miao
Shuai, Liang
Guo, Xiaohu
[J]. COMPUTER AIDED GEOMETRIC DESIGN, 2011, 28 (08) : 475 - 496
[8] State space partition for reinforcement learning based on fuzzy min-max neural network
Duan, Yong
Cui, Baoxia
Xu, Xinhe
[J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 160 - +
[9] Fuzzy CMAC with Automatic State Partition for Reinforcement Learning
Min, Huaqing
Zeng, Jiaan
Luo, Ronghua
[J]. WORLD SUMMIT ON GENETIC AND EVOLUTIONARY COMPUTATION (GEC 09), 2009, : 421 - 428
[10] Adaptive State Aggregation for Reinforcement Learning
Hwang, Kao-Shing
Chen, Yu-Jen
Jiang, Wei-Cheng
[J]. PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2452 - 2456

← 1 2 3 4 5 →