A proposition of adaptive state space partition in reinforcement learning with Voronoi Tessellation

被引:0
|
作者
Aung, Kathy Thi [1 ]
Fuchida, Takayasu [1 ]
机构
[1] Kagoshima Univ, Grad Sch Sci & Engn, Dept Syst Informat Sci, Kohrimoto 1-21-40, Kagoshima 8900065, Japan
关键词
Q-learning; LBG; new Vector quantization method; State space partitioning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new adaptive segmentation of continuous state space based on vector quantization algorithm such as LBG (Linde-Buzo-Gray) for high-dimensional continuous state spaces. The objective of adaptive state space partitioning is to develop the efficiency of learning reward values with an accumulation of state transition vector (STV) in a single-agent environment. We constructed our single-agent model in continuous state and discrete actions spaces using Q-learning function. Moreover, the study of the resulting state space partition reveals in a Voronoi tessellation. In addition, the experimental results show that this proposed method can partition the continuous state space appropriately into Voronoi regions according to not only the number of actions, and achieve a good performance of reward based learning tasks compared with other approaches such as square partition lattice.
引用
收藏
页码:638 / 641
页数:4
相关论文
共 50 条
  • [1] A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation
    Fuchida, Takayasu
    Aung, Kathy Thi
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2013, 18 (3-4) : 172 - 177
  • [2] Adaptive state space formation in reinforcement learning
    Samejima, K
    Omori, T
    [J]. ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 251 - 255
  • [3] Adaptive state space partitioning for reinforcement learning
    Lee, ISK
    Lau, HYK
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, 17 (06) : 577 - 588
  • [4] Reinforcement learning using Voronoi space division
    Aung, Kathy
    Fuchida, Takayasu
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2010, 15 (03) : 330 - 334
  • [5] Adaptive Coda-Wave Imaging With Voronoi Tessellation
    Mao, Shujuan
    Ellsworth, William L.
    Beroza, Gregory C.
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2023, 128 (08)
  • [6] A reinforcement learning with adaptive state space construction for mobile robot navigation
    Li, Guizhi
    Pang, Jie
    [J]. PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, 2006, : 84 - 88
  • [7] Centroidal Voronoi tessellation in universal covering space of manifold surfaces
    Rong, Guodong
    Jin, Miao
    Shuai, Liang
    Guo, Xiaohu
    [J]. COMPUTER AIDED GEOMETRIC DESIGN, 2011, 28 (08) : 475 - 496
  • [8] State space partition for reinforcement learning based on fuzzy min-max neural network
    Duan, Yong
    Cui, Baoxia
    Xu, Xinhe
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 160 - +
  • [9] Fuzzy CMAC with Automatic State Partition for Reinforcement Learning
    Min, Huaqing
    Zeng, Jiaan
    Luo, Ronghua
    [J]. WORLD SUMMIT ON GENETIC AND EVOLUTIONARY COMPUTATION (GEC 09), 2009, : 421 - 428
  • [10] Adaptive State Aggregation for Reinforcement Learning
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Jiang, Wei-Cheng
    [J]. PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2452 - 2456