Reward Function Using Inverse Reinforcement Learning and Fuzzy Reasoning

被引:0
|
作者
Kato, Yuta [1 ]
Kanoh, Masayoshi [2 ]
Nakamura, Tsuyoshi [3 ]
机构
[1] Chukyo Univ, Grad Sch Engn, Nagoya, Aichi, Japan
[2] Chukyo Univ, Sch Engn, Nagoya, Aichi, Japan
[3] Nagoya Inst Technol, Grad Sch Engn, Nagoya, Aichi, Japan
关键词
D O I
10.1109/SCISISIS50064.2020.9322710
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A reward function estimated with inverse reinforcement learning has been used to determine a method for controlling a robot. Inverse reinforcement learning requires observed sequences of actions to estimate a reward function. Few models of the sequences give the optimal motion of the robot; therefore, a suboptimal one may be given. However, the suboptimal sequences include some errors and ambiguities. In this paper, we propose a method for quantifying the ambiguity of the reward function, which is designed with inverse reinforcement learning using fuzzy reasoning.
引用
收藏
页码:222 / 227
页数:6
相关论文
共 50 条
  • [1] A Modified Average Reward Reinforcement Learning Based on Fuzzy Reward Function
    Zhai, Zhenkun
    Chen, Wei
    Li, Xiong
    Guo, Jing
    IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 113 - 117
  • [3] Modified reward function on abstract features in inverse reinforcement learning
    Chen, Shen-yi
    Qian, Hui
    Fan, Jia
    Jin, Zhuo-jun
    Zhu, Miao-liang
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2010, 11 (09): : 718 - 723
  • [4] Modified reward function on abstract features in inverse reinforcement learning
    Shen-yi Chen
    Hui Qian
    Jia Fan
    Zhuo-jun Jin
    Miao-liang Zhu
    Journal of Zhejiang University SCIENCE C, 2010, 11 : 718 - 723
  • [5] Compatible Reward Inverse Reinforcement Learning
    Metelli, Alberto Maria
    Pirotta, Matteo
    Restelli, Marcello
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [6] Reward Identification in Inverse Reinforcement Learning
    Kim, Kuno
    Garg, Shivam
    Shiragur, Kirankumar
    Ermon, Stefano
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [7] An Inverse Reinforcement Learning Method to Infer Reward Function of Intelligent Jammer
    Fan, Youlin
    Jiu, Bo
    Pu, Wenqiang
    Li, Kang
    Zhang, Yu
    Liu, Hongwei
    Proceedings of the IEEE Radar Conference, 2023,
  • [8] Active Learning for Reward Estimation in Inverse Reinforcement Learning
    Lopes, Manuel
    Melo, Francisco
    Montesano, Luis
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 31 - +
  • [9] Option compatible reward inverse reinforcement learning
    Hwang, Rakhoon
    Lee, Hanjin
    Hwang, Hyung Ju
    PATTERN RECOGNITION LETTERS, 2022, 154 : 83 - 89
  • [10] Inverse Reinforcement Learning with the Average Reward Criterion
    Wu, Feiyang
    Ke, Jingyang
    Wu, Anqi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,