Minimizing Expected Loss for Risk-Avoiding Reinforcement Learning

被引：0

作者：

Yeh, Jung-Jung ^{[1
]}

Kuo, Tsung-Ting ^{[1
]}

Chen, William ^{[2
]}

Lin, Shou-De ^{[1
]}

机构：

[1] Natl Taiwan Univ, Taipei, Taiwan

[2] Inst Informat Ind, Taipei, Taiwan

来源：

2014 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA) | 2014年

关键词：

reinforcement learning; risk avoiding; risk model; profit model;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper considers the design of a reinforcement learning (RL) agent that can strike a balance between return and risk. First, we discuss several favorable properties of an RL risk model, and then propose a definition of risk based on expected negative rewards. We also design a Q-decomposition-based framework that allows a reinforcement learning agent to control the balance between risk and profit. The results of experiments on both artificial and real-world stock datasets demonstrate that the proposed risk model satisfies the beneficial properties of an RL-based risk learning model, and also significantly outperforms other approaches in terms of avoiding risks.

引用

页码：11 / 17

页数：7

共 50 条

[1] Research on fuzzy risk-avoiding logic
Liu, HB
[J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES A-MATHEMATICAL ANALYSIS, 2006, 13 : 1280 - 1285
[2] Risk-avoiding cultures toward achievement of knowledge sharing
Lai, Ming-Fong
Lee, Gwo-Guang
[J]. BUSINESS PROCESS MANAGEMENT JOURNAL, 2007, 13 (04) : 522 - 537
[3] Risk-seeking versus risk-avoiding investments in noisy periodic environments
Navarro-Barrientos, J. Emeterio
Walter, Frank E.
Schweitzer, Frank
[J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2008, 19 (06): : 971 - 994
[4] Risk-facing or risk-avoiding? Group loyalty encourages subordinates to tell the truth
Cheng, Jen-Wei
Hung, Cheng-Ze
Yen, Hung-Chieh
Seih, Yi-Tai
Chien, Kang-Min
[J]. JOURNAL OF SOCIAL PSYCHOLOGY, 2022, 162 (04): : 407 - 422
[5] Active Learning of Equivalence Relations by Minimizing the Expected Loss Using Constraint Inference
Rendle, Steffen
Schmidt-Thieme, Lars
[J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 1001 - 1006
[6] LDI: Low Yields, Customization, and Managing Downside Risk-Avoiding a "Mexican Standoff"
Kurian, Sean
[J]. JOURNAL OF INVESTING, 2016, 25 (04): : 121 - 133
[7] RISK-TAKING AND RISK-AVOIDING BEHAVIOR - THE IMPACT OF SOME DISPOSITIONAL AND SITUATIONAL VARIABLES
WYATT, G
[J]. JOURNAL OF PSYCHOLOGY, 1990, 124 (04): : 437 - 447
[8] Solving Multiple Inference by Minimizing Expected Loss
Chen, Cong
Yang, Jiaqi
Chen, Chao
Yuan, Changhe
[J]. INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138, 2020, 138 : 65 - 76
[9] Risk-taking and risk-avoiding behaviors by hermit crabs across multiple environmental contexts
Gorman, Daniel
Ragagnin, Marilia N.
McCarthy, Ian D.
Turra, Alexander
[J]. JOURNAL OF EXPERIMENTAL MARINE BIOLOGY AND ECOLOGY, 2018, 506 : 25 - 29
[10] Dynamic Scheduling of Cybersecurity Analysts for Minimizing Risk Using Reinforcement Learning
Ganesan, Rajesh
Jajodia, Sushil
Shah, Ankit
Cam, Hasan
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2016, 8 (01) : 1 - 21

← 1 2 3 4 5 →