A survey of inverse reinforcement learning

被引:0
|
作者
Stephen Adams
Tyler Cody
Peter A. Beling
机构
[1] Virginia Tech,Hume Center for National Security and Technology
来源
关键词
Reinforcement learning; Inverse reinforcement learning; Inverse optimal control; Apprenticeship learning; Learning from demonstration;
D O I
暂无
中图分类号
学科分类号
摘要
Learning from demonstration, or imitation learning, is the process of learning to act in an environment from examples provided by a teacher. Inverse reinforcement learning (IRL) is a specific form of learning from demonstration that attempts to estimate the reward function of a Markov decision process from examples provided by the teacher. The reward function is often considered the most succinct description of a task. In simple applications, the reward function may be known or easily derived from properties of the system and hard coded into the learning process. However, in complex applications, this may not be possible, and it may be easier to learn the reward function by observing the actions of the teacher. This paper provides a comprehensive survey of the literature on IRL. This survey outlines the differences between IRL and two similar methods - apprenticeship learning and inverse optimal control. Further, this survey organizes the IRL literature based on the principal method, describes applications of IRL algorithms, and provides areas of future research.
引用
收藏
页码:4307 / 4346
页数:39
相关论文
共 50 条
  • [1] Survey on Inverse Reinforcement Learning
    Zhang L.-H.
    Liu Q.
    Huang Z.-G.
    Zhu F.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2023, 34 (10): : 4772 - 4803
  • [2] A survey of inverse reinforcement learning
    Adams, Stephen
    Cody, Tyler
    Beling, Peter A.
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (06) : 4307 - 4346
  • [3] A survey of inverse reinforcement learning techniques
    Shao Zhifei
    Joo, Er Meng
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2012, 5 (03) : 293 - 311
  • [4] A survey of inverse reinforcement learning: Challenges, methods and progress
    Arora, Saurabh
    Doshi, Prashant
    [J]. ARTIFICIAL INTELLIGENCE, 2021, 297 (297)
  • [5] A Survey of Inverse Reinforcement Learning Algorithms, Theory and Applications
    Song, Li
    Li, Da-Zi
    Xu, Xin
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (09): : 1704 - 1723
  • [6] Repeated Inverse Reinforcement Learning
    Amin, Kareem
    Jiang, Nan
    Singh, Satinder
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [7] Cooperative Inverse Reinforcement Learning
    Hadfield-Menell, Dylan
    Dragan, Anca
    Abbeel, Pieter
    Russell, Stuart
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [8] Misspecification in Inverse Reinforcement Learning
    Skalse, Joar
    Abate, Alessandro
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15136 - 15143
  • [9] Bayesian Inverse Reinforcement Learning
    Ramachandran, Deepak
    Amir, Eyal
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2586 - 2591
  • [10] Inverse Constrained Reinforcement Learning
    Malik, Shehryar
    Anwar, Usman
    Aghasi, Alireza
    Ahmed, Ali
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139