Applications of the self-organising map to reinforcement learning

被引:60
|
作者
Smith, AJ [1 ]
机构
[1] Univ Edinburgh, Inst Adapt & Neural Computat, Div Informat, Edinburgh EH1 2QL, Midlothian, Scotland
关键词
reinforcement learning; self-organising map; continuous action spaces; generalisation; real-valued actions; unsupervised learning; Q-learning;
D O I
10.1016/S0893-6080(02)00083-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article is concerned with the representation and generalisation of continuous action spaces in reinforcement learning (RL) problems. A model is proposed based on the self-organising map (SOM) of Kohonen [Self Organisation and Associative Memory, 19871 which allows either the one-to-one, many-to-one or one-to-many structure of the desired state-action mapping to be captured. Although presented here for tasks involving immediate reward, the approach is easily extended to delayed reward. We conclude that the SOM is a useful tool for providing real-time, on-line generalisation in RL problems in which the latent dimensionalities of the state and action spaces are small. Scalability issues are also discussed. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1107 / 1124
页数:18
相关论文
共 50 条
  • [1] Dynamic self-organising map
    Rougier, Nicolas
    Boniface, Yann
    [J]. NEUROCOMPUTING, 2011, 74 (11) : 1840 - 1847
  • [2] The Self-Organising Hierarchical Variance Map
    Kyan, Matthew J.
    Guan, Ling
    [J]. 2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 3767 - +
  • [3] Interpolating self-organising map (iSOM)
    Yin, H
    Allinson, NM
    [J]. ELECTRONICS LETTERS, 1999, 35 (19) : 1649 - 1650
  • [4] Self-organising map as a natural kernel method
    Yin, HJ
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1891 - 1894
  • [5] Application of the self-organising map to trajectory classification
    Owens, J
    Hunter, A
    [J]. THIRD IEEE INTERNATIONAL WORKSHOP ON VISUAL SURVEILLANCE, PROCEEDINGS, 2000, : 77 - 83
  • [6] Seizure detection with the self-organising feature map
    James, C
    Kobayashi, K
    Gotman, J
    [J]. ARTIFICIAL NEURAL NETWORKS IN MEDICINE AND BIOLOGY, 2000, : 143 - 148
  • [7] Bayesian self-organising map for Gaussian mixtures
    Yin, H
    Allinson, NM
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2001, 148 (04): : 234 - 240
  • [8] Visualizing Random Forest with Self-Organising Map
    Plonski, Piotr
    Zaremba, Krzysztof
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2014, PT II, 2014, 8468 : 63 - 71
  • [9] Self-organising map techniques for graph data applications to clustering of XML documents
    Tsoi, A. C.
    Hagenbuchner, M.
    Sperduti, A.
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 19 - 30
  • [10] Bayesian learning for self-organising maps
    Yin, H
    Allinson, NM
    [J]. ELECTRONICS LETTERS, 1997, 33 (04) : 304 - 305