Reinforcement learning with orthonormal basis adaptation based on activity-oriented index allocation

被引:0
|
作者
Satoh, Hideki [1 ]
机构
[1] Future Univ Hakodate, Hakodate, Hokkaido 0418655, Japan
关键词
orthonormal basis; function approximation; non-linear; reinforcentent learning; activity;
D O I
10.1093/ietfec/e91-a.4.1169
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An orthonormal basis adaptation method for function approximation was developed and applied to reinforcement learning with multi-dimensional continuous state space. First, a basis used for linear function approximation of a control function is set to an orthonormal basis. Next, basis elements with small activities are replaced with other candidate elements as learning progresses. As this replacement is repeated, the number of basis elements with large activities increases. Example chaos control problems for multiple logistic maps were solved, demonstrating that the method for adapting an orthonormal basis can modify a basis while holding the orthonormality in accordance with changes in the environment to improve the performance of reinforcement learning and to eliminate the adverse effects of redundant noisy states.
引用
收藏
页码:1169 / 1176
页数:8
相关论文
共 50 条
  • [1] An activity-oriented design framework for mobile learning experience
    Liu, Huanglingzi
    Huang, Ronghuai
    Salomaa, Jyri
    Ma, Ding
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON WIRELESS, MOBILE AND UBIQUITOUS TECHNOLOGIES IN EDUCATION, PROCEEDINGS, 2008, : 185 - +
  • [2] System Identification Based on Generalized Orthonormal Basis Function for Unmanned Helicopters: A Reinforcement Learning Approach
    Liu, Zun
    Li, Jianqiang
    Wang, Cheng
    Yu, Richard
    Chen, Jie
    He, Ying
    Sun, Changyin
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (02) : 1135 - 1145
  • [3] Unsupervised Basis Function Adaptation for Reinforcement Learning
    Barker, Edward
    Ras, Charl
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [4] Unsupervised basis function adaptation for reinforcement learning
    Barker, Edward
    Ras, Charl
    [J]. Journal of Machine Learning Research, 2019, 20
  • [5] An activity-oriented approach to visually structured knowledge representation for problem-based learning in virtual learning environments
    Miao, Y
    Holst, S
    Holmer, T
    Fleschutz, J
    Zentel, P
    [J]. DESIGNING COOPERATIVE SYSTEMS - THE USE OF THEORIES AND MODELS, 2000, 58 : 303 - 318
  • [6] A dynamic allocation method of basis functions in reinforcement learning
    Iida, S
    Kuwayama, K
    Kanoh, M
    Kato, S
    Itoh, H
    [J]. AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 272 - 283
  • [7] Basis function adaptation in temporal difference reinforcement learning
    Menache, I
    Mannor, S
    Shimkin, N
    [J]. ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) : 215 - 238
  • [8] Basis Function Adaptation in Temporal Difference Reinforcement Learning
    Ishai Menache
    Shie Mannor
    Nahum Shimkin
    [J]. Annals of Operations Research, 2005, 134 : 215 - 238
  • [9] SmartCommit: A Graph-Based Interactive Assistant for Activity-Oriented Commits
    Shen, Bo
    Zhang, Wei
    Kastner, Christian
    Zhao, Haiyan
    Wei, Zhao
    Liang, Guangtai
    Jin, Zhi
    [J]. PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 379 - 390
  • [10] Asset Allocation Based On Reinforcement Learning
    Li, Yaoming
    Wu, Junfeng
    Chen, Yun
    [J]. 2020 IEEE 18TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), VOL 1, 2020, : 397 - 402