Minimizing Expected Loss for Risk-Avoiding Reinforcement Learning

被引:0
|
作者
Yeh, Jung-Jung [1 ]
Kuo, Tsung-Ting [1 ]
Chen, William [2 ]
Lin, Shou-De [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
[2] Inst Informat Ind, Taipei, Taiwan
关键词
reinforcement learning; risk avoiding; risk model; profit model;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper considers the design of a reinforcement learning (RL) agent that can strike a balance between return and risk. First, we discuss several favorable properties of an RL risk model, and then propose a definition of risk based on expected negative rewards. We also design a Q-decomposition-based framework that allows a reinforcement learning agent to control the balance between risk and profit. The results of experiments on both artificial and real-world stock datasets demonstrate that the proposed risk model satisfies the beneficial properties of an RL-based risk learning model, and also significantly outperforms other approaches in terms of avoiding risks.
引用
收藏
页码:11 / 17
页数:7
相关论文
共 50 条
  • [1] Research on fuzzy risk-avoiding logic
    Liu, HB
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES A-MATHEMATICAL ANALYSIS, 2006, 13 : 1280 - 1285
  • [2] Risk-avoiding cultures toward achievement of knowledge sharing
    Lai, Ming-Fong
    Lee, Gwo-Guang
    [J]. BUSINESS PROCESS MANAGEMENT JOURNAL, 2007, 13 (04) : 522 - 537
  • [3] Risk-seeking versus risk-avoiding investments in noisy periodic environments
    Navarro-Barrientos, J. Emeterio
    Walter, Frank E.
    Schweitzer, Frank
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2008, 19 (06): : 971 - 994
  • [4] Risk-facing or risk-avoiding? Group loyalty encourages subordinates to tell the truth
    Cheng, Jen-Wei
    Hung, Cheng-Ze
    Yen, Hung-Chieh
    Seih, Yi-Tai
    Chien, Kang-Min
    [J]. JOURNAL OF SOCIAL PSYCHOLOGY, 2022, 162 (04): : 407 - 422
  • [5] Active Learning of Equivalence Relations by Minimizing the Expected Loss Using Constraint Inference
    Rendle, Steffen
    Schmidt-Thieme, Lars
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 1001 - 1006
  • [6] LDI: Low Yields, Customization, and Managing Downside Risk-Avoiding a "Mexican Standoff"
    Kurian, Sean
    [J]. JOURNAL OF INVESTING, 2016, 25 (04): : 121 - 133
  • [7] RISK-TAKING AND RISK-AVOIDING BEHAVIOR - THE IMPACT OF SOME DISPOSITIONAL AND SITUATIONAL VARIABLES
    WYATT, G
    [J]. JOURNAL OF PSYCHOLOGY, 1990, 124 (04): : 437 - 447
  • [8] Solving Multiple Inference by Minimizing Expected Loss
    Chen, Cong
    Yang, Jiaqi
    Chen, Chao
    Yuan, Changhe
    [J]. INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138, 2020, 138 : 65 - 76
  • [9] Risk-taking and risk-avoiding behaviors by hermit crabs across multiple environmental contexts
    Gorman, Daniel
    Ragagnin, Marilia N.
    McCarthy, Ian D.
    Turra, Alexander
    [J]. JOURNAL OF EXPERIMENTAL MARINE BIOLOGY AND ECOLOGY, 2018, 506 : 25 - 29
  • [10] Dynamic Scheduling of Cybersecurity Analysts for Minimizing Risk Using Reinforcement Learning
    Ganesan, Rajesh
    Jajodia, Sushil
    Shah, Ankit
    Cam, Hasan
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2016, 8 (01) : 1 - 21