Unsupervised Modeling of Partially Observable Environments

被引:0
|
作者
Graziano, Vincent [1 ]
Koutnik, Jan [1 ]
Schmidhuber, Juergen [1 ]
机构
[1] Univ Lugano, SUPSI, IDSIA, CH-6928 Manno, Switzerland
关键词
Self-Organizing Maps; POMDPs; Reinforcement Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an architecture based on self-organizing maps for learning a sensory layer in a learning system. The architecture, temporal network for transitions (TNT), enjoys the freedoms of unsupervised learning, works on-line, in non-episodic environments, is computationally light, and scales well. TNT generates a predictive model of its internal representation of the world, making planning methods available for both the exploitation and exploration of the environment. Experiments demonstrate that TNT learns nice representations of classical reinforcement learning mazes of varying size (up to 20 x 20) under conditions of high-noise and stochastic actions.
引用
下载
收藏
页码:503 / 515
页数:13
相关论文
共 50 条
  • [31] Emergence of Sensory Representations Using Prediction in Partially Observable Environments
    Kulak, Thibaut
    Ortiz, Michael Garcia
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 11140 : 489 - 498
  • [32] Secure Control in Partially Observable Environments to Satisfy LTL Specifications
    Ramasubramanian, Bhaskar
    Niu, Luyao
    Clark, Andrew
    Bushnell, Linda
    Poovendran, Radha
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (12) : 5665 - 5679
  • [33] Switching Q-learning in partially observable Markovian environments
    Kamaya, H
    Lee, H
    Abe, K
    2000 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2000), VOLS 1-3, PROCEEDINGS, 2000, : 1062 - 1067
  • [34] Spatial Consciousness Model of Intrinsic Reward in Partially Observable Environments
    Ni, Zhenghongyuan
    Jin, Ye
    Liu, Peng
    Zhao, Wei
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 106 (04)
  • [35] Human-Assisted Rule Satisfaction in Partially Observable Environments
    Degeler, Viktoriya
    Curry, Edward
    2014 IEEE 11TH INTL CONF ON UBIQUITOUS INTELLIGENCE AND COMPUTING AND 2014 IEEE 11TH INTL CONF ON AUTONOMIC AND TRUSTED COMPUTING AND 2014 IEEE 14TH INTL CONF ON SCALABLE COMPUTING AND COMMUNICATIONS AND ITS ASSOCIATED WORKSHOPS, 2014, : 171 - 178
  • [36] Sequential Action and Beliefs Under Partially Observable DSGE Environments
    Seong-Hoon Kim
    Computational Economics, 2012, 40 : 219 - 244
  • [37] Sequential Action and Beliefs Under Partially Observable DSGE Environments
    Kim, Seong-Hoon
    COMPUTATIONAL ECONOMICS, 2012, 40 (03) : 219 - 244
  • [38] Mixed Observable RRT: Multi-Agent Mission-Planning in Partially Observable Environments
    Johansson, Kasper
    Rosolia, Ugo
    Ubellacker, Wyatt
    Singletary, Andrew
    Ames, Aaron D.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 1386 - 1392
  • [39] Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
    Srinivasan, Sriram
    Lanctot, Marc
    Zambaldi, Vinicius
    Perolat, Julien
    Tuyls, Karl
    Munos, Remi
    Bowling, Michael
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [40] A neurally plausible model learns successor representations in partially observable environments
    Vertes, Eszter
    Sahani, Maneesh
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32