Stochastic optimization of multireservoir systems via reinforcement learning

被引:76
|
作者
Lee, Jin-Hee [1 ]
Labadie, John W.
机构
[1] Colorado State Univ, Dept Civil Engn, Ft Collins, CO 80523 USA
[2] Inha Univ, Dept Civil Engn, Div Civil Environm Geoinformat Engn, Inchon 402751, South Korea
关键词
D O I
10.1029/2006WR005627
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Although several variants of stochastic dynamic programming have been applied to optimal operation of multireservoir systems, they have been plagued by a high-dimensional state space and the inability to accurately incorporate the stochastic environment as characterized by temporally and spatially correlated hydrologic inflows. Reinforcement learning has emerged as an effective approach to solving sequential decision problems by combining concepts from artificial intelligence, cognitive science, and operations research. A reinforcement learning system has a mathematical foundation similar to dynamic programming and Markov decision processes, with the goal of maximizing the long-term reward or returns as conditioned on the state of the system environment and the immediate reward obtained from operational decisions. Reinforcement learning can include Monte Carlo simulation where transition probabilities and rewards are not explicitly known a priori. The Q-Learning method in reinforcement learning is demonstrated on the two-reservoir Geum River system, South Korea, and is shown to outperform implicit stochastic dynamic programming and sampling stochastic dynamic programming methods.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Comparative analysis of evolving artificial neural network and reinforcement learning in stochastic optimization of multireservoir systems
    Dariane, Alireza B.
    Moradi, Amir Mohammad
    [J]. HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2016, 61 (06): : 1141 - 1156
  • [2] STOCHASTIC OPTIMIZATION OF INTERCONNECTED MULTIRESERVOIR POWER-SYSTEMS
    LI, CA
    YAN, R
    ZHOU, JY
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 1990, 5 (04) : 1487 - 1496
  • [3] DEVELOPMENT OF A STOCHASTIC OPTIMIZATION FOR MULTIRESERVOIR SCHEDULING
    HALLIBURTON, TS
    SIRISENA, HR
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1984, 29 (01) : 82 - 84
  • [4] Variational Optimization Based Reinforcement Learning for Infinite Dimensional Stochastic Systems
    Evans, Ethan N.
    Periera, Marcus A.
    Boutselis, George I.
    Theodorou, Evangelos A.
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [5] Assessing marginal water values in multipurpose multireservoir systems via stochastic programming
    Tilmant, A.
    Pinte, D.
    Goor, Q.
    [J]. WATER RESOURCES RESEARCH, 2008, 44 (12)
  • [6] Optimization of Multireservoir Systems by Genetic Algorithm
    Onur Hınçal
    A. Burcu Altan-Sakarya
    A. Metin Ger
    [J]. Water Resources Management, 2011, 25 : 1465 - 1487
  • [7] STOCHASTIC OPTIMIZATION OF A MULTIRESERVOIR HYDROELECTRIC SYSTEM - A DECOMPOSITION APPROACH
    PEREIRA, MVF
    PINTO, LMVG
    [J]. WATER RESOURCES RESEARCH, 1985, 21 (06) : 779 - 792
  • [8] Stochastic optimization of multireservoir systems using a heuristic algorithm: Case study from India
    Ponnambalam, K
    Adams, BJ
    [J]. WATER RESOURCES RESEARCH, 1996, 32 (03) : 733 - 741
  • [9] Optimization of Multireservoir Systems by Genetic Algorithm
    Hincal, Onur
    Altan-Sakarya, A. Burcu
    Ger, A. Metin
    [J]. WATER RESOURCES MANAGEMENT, 2011, 25 (05) : 1465 - 1487
  • [10] Accelerating Optimization and Reinforcement Learning with Quasi Stochastic Approximation
    Chen, Shuhang
    Devraj, Adithya
    Bernstein, Andrey
    Meyn, Sean
    [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1965 - 1972