Modified reward function on abstract features in inverse reinforcement learning

被引:2
|
作者
Shenyi CHENHui QIANJia FANZhuojun JINMiaoliang ZHUSchool of Computer Science and TechnologyZhejiang UniversityHangzhou China [310027 ]
机构
关键词
D O I
暂无
中图分类号
TP181 [自动推理、机器学习];
学科分类号
摘要
We improve inverse reinforcement learning(IRL) by applying dimension reduction methods to automatically extract Abstract features from human-demonstrated policies,to deal with the cases where features are either unknown or numerous.The importance rating of each abstract feature is incorporated into the reward function.Simulation is performed on a task of driving in a five-lane highway,where the controlled car has the largest fixed speed among all the cars.Performance is almost 10.6% better on average with than without importance ratings.
引用
收藏
页码:718 / 723
页数:6
相关论文
共 50 条
  • [41] Information Directed Reward Learning for Reinforcement Learning
    Lindner, David
    Turchetta, Matteo
    Tschiatschek, Sebastian
    Ciosek, Kamil
    Krause, Andreas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [42] Reinforcement learning reward functions for unsupervised learning
    Fyfe, Colin
    Lai, Pei Ling
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 397 - +
  • [43] Estimating consistent reward of expert in multiple dynamics via linear programming inverse reinforcement learning
    Nakata Y.
    Arai S.
    Transactions of the Japanese Society for Artificial Intelligence, 2019, 34 (06)
  • [44] Bavesian inverse reinforcement learning for demonstrations of an expert in multiple dynamics: Toward estimation of transferable reward
    Yusukc N.
    Sachiyo A.
    Transactions of the Japanese Society for Artificial Intelligence, 2020, 35 (01)
  • [45] Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation
    Liu, Jialin
    Su, Xinyan
    He, Zeyu
    Li, Jun
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1262 - 1267
  • [46] Trajectory Planning of Rehabilitation Exercises using Integrated Reward Function in Reinforcement Learning
    Shi Y.
    Peng Q.
    Zhang J.
    Computer-Aided Design and Applications, 2022, 19 (05): : 1042 - 1054
  • [47] A Dynamic Adjusting Reward Function Method for Deep Reinforcement Learning with Adjustable Parameters
    Hu, Zijian
    Wan, Kaifang
    Gao, Xiaoguang
    Zhai, Yiwei
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [48] Simulation study on reward function of reinforcement learning in gantry work cell scheduling
    Qu, Xinyan
    Chang, Qing
    Chakraborty, Nilanjan
    JOURNAL OF MANUFACTURING SYSTEMS, 2019, 50 : 1 - 8
  • [49] Embedded draw-down constraint reward function for deep reinforcement learning
    Wu, Jimmy Ming -Tai
    Lin, Sheng-Hao
    Syu, Jia-Hao
    Wu, Mu-En
    APPLIED SOFT COMPUTING, 2022, 125
  • [50] An Analysis of Feature Selection and Reward Function for Model-Based Reinforcement Learning
    Shen, Shitian
    Lin, Chen
    Mostafavi, Behrooz
    Barnes, Tiffany
    Chi, Min
    INTELLIGENT TUTORING SYSTEMS, ITS 2016, 2016, 9684 : 504 - 505