Scaled free-energy based reinforcement learning for robust and efficient learning in high-dimensional state spaces

被引:8
|
作者
Elfwing, Stefan [1 ]
Uchibe, Eiji [1 ]
Doya, Kenji [1 ]
机构
[1] Grad Univ, Okinawa Inst Sci & Technol, Neural Computat Unit, Onna Son, Okinawa 9040412, Japan
来源
关键词
reinforcement learning; free-energy; restricted Boltzmann machine; robot navigation; function approximation; SPATIAL COGNITION; NAVIGATION; MODEL;
D O I
10.3389/fnbot.2013.00003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Free energy based reinforcement learning (FERL) was proposed for learning in high-dimensional state- and action spaces, which cannot be handled by standard function approximation methods. In this study, we propose a scaled version of free-energy based reinforcement learning to achieve more robust and more efficient learning performance. The action value function is approximated by the negative free energy of a restricted Boltzmann machine, divided by a constant scaling factor that is related to the size of the Boltzmann machine (the square root of the number of state nodes in this study). Our first task is a digit floor gridworld task, where the states are represented by images of handwritten digits from the MNIST data set. The purpose of the task is to investigate the proposed method's ability, through the extraction of task relevant features in the hidden layer, to cluster images of the same digit and to cluster images of different digits that corresponds to states with the same optimal action. We also test the method's robustness with respect to different exploration schedules, i.e., different settings of the initial temperature and the temperature discount rate in softmax action selection. Our second task is a robot visual navigation task, where the robot can learn its position by the different colors of the lower part of four landmarks and it can infer the correct corner goal area by the color of the upper part of the landmarks. The state space consists of binarized camera images with, at most, nine different colors, which is equal to 6642 binary states. For both tasks, the learning performance is compared with standard FERL and with function approximation where the action-value function is approximated by a two-layered feedforward neural network.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Learning in high-dimensional multimedia data: the state of the art
    Lianli Gao
    Jingkuan Song
    Xingyi Liu
    Junming Shao
    Jiajun Liu
    Jie Shao
    Multimedia Systems, 2017, 23 : 303 - 313
  • [42] Robust transfer learning of high-dimensional generalized linear model
    Sun, Fei
    Zhang, Qi
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 618
  • [43] Exploring the robust extrapolation of high-dimensional machine learning potentials
    Zeni, Claudio
    Anelli, Andrea
    Glielmo, Aldo
    Rossi, Kevin
    PHYSICAL REVIEW B, 2022, 105 (16)
  • [44] Robust transfer learning for high-dimensional regression with linear constraints
    Chen, Xuan
    Song, Yunquan
    Wang, Yuanfeng
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (11) : 2462 - 2482
  • [45] Free-energy calculations along a high-dimensional fragmented path with constrained dynamics
    Chen, Changjun
    Huang, Yanzhao
    Xiao, Yi
    PHYSICAL REVIEW E, 2012, 86 (03):
  • [46] High-dimensional reinforcement learning for optimization and control of ultracold quantum gases
    Milson, N.
    Tashchilina, A.
    Ooi, T.
    Czarnecka, A.
    Ahmad, Z. F.
    Leblanc, L. J.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):
  • [47] High-Dimensional Ensemble Learning Classification: An Ensemble Learning Classification Algorithm Based on High-Dimensional Feature Space Reconstruction
    Zhao, Miao
    Ye, Ning
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [48] Efficient Private Empirical Risk Minimization for High-dimensional Learning
    Kasiviswanathan, Shiva Prasad
    Jin, Hongxia
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [49] Memory-Efficient Learning for High-Dimensional MRI Reconstruction
    Wang, Ke
    Kellman, Michael
    Sandino, Christopher M.
    Zhang, Kevin
    Vasanawala, Shreyas S.
    Tamir, Jonathan, I
    Yu, Stella X.
    Lustig, Michael
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VI, 2021, 12906 : 461 - 470
  • [50] Reinforcement Learning Based Energy Efficient Underwater Localization
    You, Xudong
    Lv, Zefang
    Ding, Yuzhen
    Su, Wei
    Xiao, Liang
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 927 - 932