A Modified Average Reward Reinforcement Learning Based on Fuzzy Reward Function

被引:0
|
作者
Zhai, Zhenkun [1 ]
Chen, Wei [1 ]
Li, Xiong [1 ]
Guo, Jing [1 ]
机构
[1] Guangdong Univ Technol, Fac Automat, Guangzhou 510006, Guangdong, Peoples R China
关键词
reinforcement learning; fuzzy inference system; reward function; R-learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of this paper is to propose a fuzzy reward function for improving the learning efficiency of reinforcement learning. Reinforcement learning is a sort of on-line learning approach. During the learning process, learning system (often called an agent) learns how to operate in the environment, basing upon the effect of action-reward signal. Actually, the reward signal is always represented with a reward function, of which the role is to evaluate whether the agent acts well or not. According to the distribution of rewards in the space of states, reward function can be classified as two basic forms, dense function and sparse function. Parse function has the advantage of the easy implementation, but its learning efficiency is relatively low. As for dense function, it is very difficult to design when the size of state space is very large. In response to these defects, we present a methodology for designing a fuzzy reward functions by the use of fuzzy inference system. As a result of this solution, not only generalization ability of reward function can be improved, but also we are apt to embed expert experience in reinforcement learning system. Through applying the fuzzy reward function to R-learning and making simulation experiment, we verify that the learning efficiency for R-learning is effectively improved.
引用
收藏
页码:113 / 117
页数:5
相关论文
共 50 条
  • [1] Application of Fuzzy Inference System to Average Reward Reinforcement Learning
    Chen, Wei
    Zhai, Zhenkun
    Li, Xiong
    Guo, Jing
    [J]. 2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 374 - 377
  • [2] Hierarchical average reward reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 2629 - 2669
  • [3] Hierarchical average reward reinforcement learning
    Department of Computing Science, University of Alberta, Edmonton, Alta. T6G 2E8, Canada
    不详
    [J]. Journal of Machine Learning Research, 2007, 8 : 2629 - 2669
  • [4] Model-based average reward reinforcement learning
    Tadepalli, P
    Ok, D
    [J]. ARTIFICIAL INTELLIGENCE, 1998, 100 (1-2) : 177 - 224
  • [5] Reward Function Using Inverse Reinforcement Learning and Fuzzy Reasoning
    Kato, Yuta
    Kanoh, Masayoshi
    Nakamura, Tsuyoshi
    [J]. 2020 JOINT 11TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 21ST INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS-ISIS), 2020, : 222 - 227
  • [6] Robust Average-Reward Reinforcement Learning
    Wang, Yue
    Velasquez, Alvaro
    Atia, George
    Prater-Bennette, Ashley
    Zou, Shaofeng
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 80 : 719 - 803
  • [7] Robust Average-Reward Reinforcement Learning
    Wang, Yue
    Velasquez, Alvaro
    Atia, George
    Prater-Bennette, Ashley
    Zou, Shaofeng
    [J]. Journal of Artificial Intelligence Research, 2024, 80 : 719 - 803
  • [8] Inverse Reinforcement Learning with the Average Reward Criterion
    Wu, Feiyang
    Ke, Jingyang
    Wu, Anqi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] Learning classifier system with average reward reinforcement learning
    Zang, Zhaoxiang
    Li, Dehua
    Wang, Junying
    Xia, Dan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 58 - 71
  • [10] Modified reward function on abstract features in inverse reinforcement learning
    Shen-yi Chen
    Hui Qian
    Jia Fan
    Zhuo-jun Jin
    Miao-liang Zhu
    [J]. Journal of Zhejiang University SCIENCE C, 2010, 11 : 718 - 723