Applications of the self-organising map to reinforcement learning

被引:60
|
作者
Smith, AJ [1 ]
机构
[1] Univ Edinburgh, Inst Adapt & Neural Computat, Div Informat, Edinburgh EH1 2QL, Midlothian, Scotland
关键词
reinforcement learning; self-organising map; continuous action spaces; generalisation; real-valued actions; unsupervised learning; Q-learning;
D O I
10.1016/S0893-6080(02)00083-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article is concerned with the representation and generalisation of continuous action spaces in reinforcement learning (RL) problems. A model is proposed based on the self-organising map (SOM) of Kohonen [Self Organisation and Associative Memory, 19871 which allows either the one-to-one, many-to-one or one-to-many structure of the desired state-action mapping to be captured. Although presented here for tasks involving immediate reward, the approach is easily extended to delayed reward. We conclude that the SOM is a useful tool for providing real-time, on-line generalisation in RL problems in which the latent dimensionalities of the state and action spaces are small. Scalability issues are also discussed. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1107 / 1124
页数:18
相关论文
共 50 条
  • [41] Self-Organising Services
    Serugendo, Giovanna Di Marzo
    Fernandez-Marquez, Jose Luis
    [J]. 2013 IEEE 7TH INTERNATIONAL CONFERENCE ON SELF-ADAPTIVE AND SELF-ORGANIZING SYSTEMS (SASO), 2013, : 257 - 258
  • [42] Self-Organising Autocatalysis
    Virgo, Nathaniel
    McGregor, Simon
    Ikegami, Takashi
    [J]. ALIFE 2014: THE FOURTEENTH INTERNATIONAL CONFERENCE ON THE SYNTHESIS AND SIMULATION OF LIVING SYSTEMS, 2014, : 498 - 505
  • [43] Self-organising systems
    Bonham, Jim
    [J]. AP Australian Printer Magazine, 2002, (SEP.):
  • [44] Applications of Self-Organising Map (SOM) for prioritisation of endemic zones of filariasis in Andhra Pradesh, India
    Murty, Upadhayula Suryanaryana
    Rao, Mutheneni Srinivasa
    Sriram, K.
    Rao, K. Madhusudhan
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2011, 5 (04) : 417 - 427
  • [45] Building blocks for aggregate programming of self-organising applications
    Beal, Jacob
    Viroli, Mirko
    [J]. 2014 IEEE EIGHTH INTERNATIONAL CONFERENCE ON SELF-ADAPTIVE AND SELF-ORGANIZING SYSTEMS WORKSHOPS (SASOW), 2014, : 8 - 13
  • [46] Impulse fault diagnosis in power transformers using self-organising map and learning vector quantisation
    De, A
    Chatterjee, N
    [J]. IEE PROCEEDINGS-GENERATION TRANSMISSION AND DISTRIBUTION, 2001, 148 (05) : 397 - 405
  • [47] A Self-Organising Multi-Manifold Learning Algorithm
    Yin, Hujun
    Zaki, Shireen Mohd
    [J]. BIOINSPIRED COMPUTATION IN ARTIFICIAL SYSTEMS, PT II, 2015, 9108 : 389 - 398
  • [48] Adaptive Learning in Motion Analysis with Self-Organising Maps
    Angelopoulou, Anastassia
    Garcia-Rodriguez, Jose
    Psarrou, Alexandra
    Gupta, Gaurav
    Mentzelopoulos, Markos
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [49] Active audition using the parameter-less self-organising map
    Berglund, Erik
    Sitte, Joaquin
    Wyeth, Gordon
    [J]. AUTONOMOUS ROBOTS, 2008, 24 (04) : 401 - 417
  • [50] A Drowsiness Detection Decision Support System using Self-Organising Map
    Emmanuel, Fabunmi Temitayo
    Ojo, Adedayo Olukayode
    Gbadamosi, Saheed Lekan
    [J]. 2022 IEEE NIGERIA 4TH INTERNATIONAL CONFERENCE ON DISRUPTIVE TECHNOLOGIES FOR SUSTAINABLE DEVELOPMENT (IEEE NIGERCON), 2022, : 307 - 310