Risk-sensitive Inverse Reinforcement Learning via Coherent Risk Models

被引：0

作者：

Majumdar, Anirudha ^{[1
]}

Singh, Sumeet ^{[1
]}

Mandlekar, Ajay ^{[2
]}

Pavone, Marco ^{[1
]}

机构：

[1] Stanford Univ, Dept Aeronaut & Astronaut, Stanford, CA 94305 USA

[2] Stanford Univ, Elect Engn, Stanford, CA 94305 USA

来源：

ROBOTICS: SCIENCE AND SYSTEMS XIII | 2017年

关键词：

MARKOV DECISION-PROCESSES; EXPECTED-UTILITY;

D O I：

暂无

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The literature on Inverse Reinforcement Learning (IRL) typically assumes that humans take actions in order to minimize the expected value of a cost function, i.e., that humans are risk neutral. Yet, in practice, humans are often far from being risk neutral. To fill this gap, the objective of this paper is to devise a framework for risk-sensitive IRL in order to explicitly account for an expert's risk sensitivity. To this end, we propose a flexible class of models based on coherent risk metrics, which allow us to capture an entire spectrum of risk preferences from risk-neutral to worst-case. We propose efficient algorithms based on Linear Programming for inferring an expert's underlying risk metric and cost function for a rich class of static and dynamic decision-making settings. The resulting approach is demonstrated on a simulated driving game with ten human participants. Our method is able to infer and mimic a wide range of qualitatively different driving styles from highly risk-averse to risk-neutral in a data-efficient manner. Moreover, comparisons of the Risk-Sensitive (RS) IRL approach with a risk-neutral model show that the RS-IRL framework more accurately captures observed participant behavior both qualitatively and quantitatively.

引用

页数：10

共 50 条

[21] Risk-sensitive reinforcement learning algorithms with generalized average criterion
殷苌茗
王汉兴
赵飞
AppliedMathematicsandMechanics(EnglishEdition), 2007, (03) : 405 - 416
[22] State-Augmentation Transformations for Risk-Sensitive Reinforcement Learning
Ma, Shuai
Yu, Jia Yuan
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4512 - 4519
[23] Risk-sensitive reinforcement learning algorithms with generalized average criterion
Chang-ming Yin
Wang Han-xing
Zhao Fei
Applied Mathematics and Mechanics, 2007, 28 : 405 - 416
[24] Risk-sensitive reinforcement learning algorithms with generalized average criterion
Yin Chang-ming
Wang Han-xing
Zhao Fei
APPLIED MATHEMATICS AND MECHANICS-ENGLISH EDITION, 2007, 28 (03) : 405 - 416
[25] Risk-Sensitive Reinforcement Learning with Function Approximation: A Debiasing Approach
Fei, Yingjie
Yang, Zhuoran
Wang, Zhaoran
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[26] Risk-sensitive reinforcement learning applied to control under constraints
Geibel, P
Wysotzki, F
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2005, 24 : 81 - 108
[27] Risk-Sensitive Reinforcement Learning for URLLC Traffic in Wireless Networks
Ben Khalifa, Nesrine
Assaad, Mohamad
Debbah, Merouane
2019 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2019,
[28] Risk-Sensitive Portfolio Management by using Distributional Reinforcement Learning
Harnpadungkij, Thammasorn
Chaisangmongkon, Warasinee
Phunchongharn, Phond
2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 110 - 115
[29] Risk-sensitive reinforcement learning applied to control under constraints
Geibel, P. (PGEIBEL@UOS.DE), 1600, American Association for Artificial Intelligence (24):
[30] Robust Ranking Models via Risk-Sensitive Optimization
Wang, Lidan
Bennett, Paul N.
Collins-Thompson, Kevyn
SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 761 - 770

← 1 2 3 4 5 →