Modified reward function on abstract features in inverse reinforcement learning

被引：2

作者：

Shenyi CHENHui QIANJia FANZhuojun JINMiaoliang ZHUSchool of Computer Science and TechnologyZhejiang UniversityHangzhou China ^{[310027
]}

机构：

来源：

Journal of Zhejiang University-Science C(Computer & Electronics) | 2010年 / 11卷 / 09期

关键词：

D O I：

暂无

中图分类号：

TP181 [自动推理、机器学习];

学科分类号：

摘要：

We improve inverse reinforcement learning(IRL) by applying dimension reduction methods to automatically extract Abstract features from human-demonstrated policies,to deal with the cases where features are either unknown or numerous.The importance rating of each abstract feature is incorporated into the reward function.Simulation is performed on a task of driving in a five-lane highway,where the controlled car has the largest fixed speed among all the cars.Performance is almost 10.6% better on average with than without importance ratings.

引用

页码：718 / 723

页数：6

共 50 条

[41] Information Directed Reward Learning for Reinforcement Learning
Lindner, David
Turchetta, Matteo
Tschiatschek, Sebastian
Ciosek, Kamil
Krause, Andreas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[42] Reinforcement learning reward functions for unsupervised learning
Fyfe, Colin
Lai, Pei Ling
ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 397 - +
[43] Estimating consistent reward of expert in multiple dynamics via linear programming inverse reinforcement learning
Nakata Y.
Arai S.
Transactions of the Japanese Society for Artificial Intelligence, 2019, 34 (06)
[44] Bavesian inverse reinforcement learning for demonstrations of an expert in multiple dynamics: Toward estimation of transferable reward
Yusukc N.
Sachiyo A.
Transactions of the Japanese Society for Artificial Intelligence, 2020, 35 (01)
[45] Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation
Liu, Jialin
Su, Xinyan
He, Zeyu
Li, Jun
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1262 - 1267
[46] Trajectory Planning of Rehabilitation Exercises using Integrated Reward Function in Reinforcement Learning
Shi Y.
Peng Q.
Zhang J.
Computer-Aided Design and Applications, 2022, 19 (05): : 1042 - 1054
[47] A Dynamic Adjusting Reward Function Method for Deep Reinforcement Learning with Adjustable Parameters
Hu, Zijian
Wan, Kaifang
Gao, Xiaoguang
Zhai, Yiwei
MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
[48] Simulation study on reward function of reinforcement learning in gantry work cell scheduling
Qu, Xinyan
Chang, Qing
Chakraborty, Nilanjan
JOURNAL OF MANUFACTURING SYSTEMS, 2019, 50 : 1 - 8
[49] Embedded draw-down constraint reward function for deep reinforcement learning
Wu, Jimmy Ming -Tai
Lin, Sheng-Hao
Syu, Jia-Hao
Wu, Mu-En
APPLIED SOFT COMPUTING, 2022, 125
[50] An Analysis of Feature Selection and Reward Function for Model-Based Reinforcement Learning
Shen, Shitian
Lin, Chen
Mostafavi, Behrooz
Barnes, Tiffany
Chi, Min
INTELLIGENT TUTORING SYSTEMS, ITS 2016, 2016, 9684 : 504 - 505

← 1 2 3 4 5 →