Markov Decision Processes with Average-Value-at-Risk criteria

被引：79

作者：

Baeuerle, Nicole ^{[1
]}

Ott, Jonathan ^{[1
]}

机构：

[1] Karlsruhe Inst Technol, Inst Stochast, D-76128 Karlsruhe, Germany

来源：

MATHEMATICAL METHODS OF OPERATIONS RESEARCH | 2011年 / 74卷 / 03期

关键词：

Markov Decision Problem; Average-Value-at-Risk; Time-consistency; Risk aversion; TIME; OPTIMIZATION; VARIANCE;

D O I：

10.1007/s00186-011-0367-0

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

We investigate the problem of minimizing the Average-Value-at-Risk (AVaR(tau)) of the discounted cost over a finite and an infinite horizon which is generated by a Markov Decision Process (MDP). We show that this problem can be reduced to an ordinary MDP with extended state space and give conditions under which an optimal policy exists. We also give a time-consistent interpretation of the AVaR(tau). At the end we consider a numerical example which is a simple repeated casino game. It is used to discuss the influence of the risk aversion parameter tau of the AVaR(tau)-criterion.

引用

页码：361 / 379

页数：19

共 50 条

[41] RISK-SENSITIVE AVERAGE OPTIMALITY FOR DISCRETE-TIME MARKOV DECISION PROCESSES
Chen, Xian
Wei, Qingda
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2023, 61 (01) : 72 - 104
[42] SEPARABLE VALUE-FUNCTIONS FOR INFINITE HORIZON AVERAGE REWARD MARKOV DECISION-PROCESSES
WHITE, DJ
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1989, 144 (02) : 450 - 465
[43] Average criteria in denumerable semi-Markov decision chains under risk-aversion
Rolando Cavazos-Cadena
Hugo Cruz-Suárez
Raúl Montes-De-Oca
Discrete Event Dynamic Systems, 2023, 33 : 221 - 256
[44] Average criteria in denumerable semi-Markov decision chains under risk-aversion
Cavazos-Cadena, Rolando
Cruz-Suarez, Hugo
Montes-De-Oca, Raul
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2023, 33 (03): : 221 - 256
[45] Constrained semi-Markov decision processes with ratio and time expected average criteria in Polish spaces
Wei, Qingda
Guo, Xianping
OPTIMIZATION, 2015, 64 (07) : 1593 - 1623
[46] Risk sensitive Markov decision processes
Marcus, SI
FernandezGaucherand, E
HernandezHernandez, D
Coraluppi, S
Fard, P
SYSTEMS AND CONTROL IN THE TWENTY-FIRST CENTURY, 1997, 22 : 263 - 279
[47] Bayesian Risk Markov Decision Processes
Lin, Yifan
Ren, Yuxuan
Zhou, Enlu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[48] Conditional Value-at-Risk for Reachability and Mean Payoff in Markov Decision Processes
Kretinsky, Jan
Meggendorfer, Tobias
LICS'18: PROCEEDINGS OF THE 33RD ANNUAL ACM/IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE, 2018, : 609 - 618
[49] Value set iteration for Markov decision processes
Chang, Hyeong Soo
AUTOMATICA, 2014, 50 (07) : 1940 - 1943
[50] Continuity of the value of competitive Markov decision processes
Solan, E
JOURNAL OF THEORETICAL PROBABILITY, 2003, 16 (04) : 831 - 845

← 1 2 3 4 5 →