A reinforcement learning approach for waterflooding optimization in petroleum reservoirs

被引:37
|
作者
Hourfar, Farzad [1 ]
Bidgoly, Hamed Jalaly [1 ]
Moshiri, Behzad [1 ]
Salahshoor, Karim [2 ]
Elkamel, Ali [3 ,4 ]
机构
[1] Univ Tehran, CIPCE, Sch Elect & Comp Engn, Tehran, Iran
[2] Petr Univ Technol, Dept Automat & Instrumentat Engn, Ahvaz, Iran
[3] Univ Waterloo, Dept Chem Engn, Waterloo, ON, Canada
[4] Khalifa Univ, Petr Inst, Dept Chem Engn, Abu Dhabi, U Arab Emirates
关键词
Waterflooding process; Reinforcement learning; Production optimization; Closed-loop reservoir management; Derivative-free optimization; MULTIPHASE FLOW; SUBSURFACE FLOW; TERM PRODUCTION; POROUS-MEDIA; OIL-FIELD; LONG-TERM; MANAGEMENT; MODEL; TIME; PERFORMANCE;
D O I
10.1016/j.engappai.2018.09.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Waterflooding optimization in closed-loop management of the oil reservoirs is always considered as a challenging issue due to the complicated and unpredicted dynamics of the process. The main goal in waterflooding is to adjust the manipulated variables such that the total oil production or a defined objective function, which has a strong correlation with the gained financial profit, is maximized. Fortunately, due to the recent progresses in the computational tools and also expansion of the calculating facilities, utilization of non-conventional optimization methods is feasible to achieve the desired goals. In this paper, waterflooding optimization problem has been defined and formulated in the framework of Reinforcement Learning (RL) methodology, which is known as a derivative-free and also model-free optimization approach. This technique prevents from the challenges corresponding with the complex gradient calculations for handling the objective functions. So, availability of explicit dynamic models of the reservoir for gradient computations is not mandatory to apply the proposed method. The developed algorithm provides the facility to achieve the desired operational targets, by appropriately defining the learning problem and the necessary variables. The fundamental learning elements such as actions, states, and rewards have been delineated both in discrete and continuous domain. The proposed methodology has been implemented and assessed on the Egg-model which is a popular and well-known reservoir case study. Different configurations for active injection and production wells have been taken into account to simulate Single-Input-Multi-Output (SIMO) as well as Multi-Input-Multi-Output (MIMO) optimization scenarios. The results demonstrate that the "agent" is able to gradually, but successfully learn the most appropriate sequence of actions tailored for each practical scenario. Consequently, the manipulated variables (actions) are set optimally to satisfy the defined production objectives which are generally dictated by the management level or even contractual obligations. Moreover, it has been shown that by properly adjustment of the rewarding policies in the learning process, diverse forms of multi-objective optimization problems can be formulated, analyzed and solved.
引用
收藏
页码:98 / 116
页数:19
相关论文
共 50 条
  • [31] A deep reinforcement learning approach to mountain railway alignment optimization
    Gao, Tianci
    Li, Zihan
    Gao, Yan
    Schonfeld, Paul
    Feng, Xiaoyun
    Wang, Qingyuan
    He, Qing
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2022, 37 (01) : 73 - 92
  • [32] Network routing optimization approach based on deep reinforcement learning
    Meng L.
    Guo B.
    Yang W.
    Zhang X.
    Zhao Z.
    Huang S.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2022, 44 (07): : 2311 - 2318
  • [33] A Deep Reinforcement Learning Approach for Federated Learning Optimization with UAV Trajectory Planning
    Zhang, Chunyu
    Liu, Yiming
    Zhang, Zhi
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [34] Learning without Gradients: Multi-Agent Reinforcement Learning approach to optimization
    Morcos, Amir
    Man, Hong
    West, Aaron
    Maguire, Brian
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS IV, 2022, 12276
  • [35] Alcohol-Assisted Waterflooding in Carbonate Reservoirs
    Al Maskari, Nasser S.
    Saeedi, Ali
    Xie, Quan
    ENERGY & FUELS, 2019, 33 (11) : 10651 - 10658
  • [36] WATERFLOODING WILL BENEFIT SOME GAS-RESERVOIRS
    RIVASGOMEZ, S
    WORLD OIL, 1983, 196 (05) : 71 - &
  • [37] Waterflooding Performance in Inclined Communicating Stratified Reservoirs
    El-Khatib, Noaman A. F.
    SPE JOURNAL, 2012, 17 (01): : 31 - 42
  • [38] OSCAR: a Contention Window Optimization approach using Deep Reinforcement Learning
    Grasso, Christian
    Raftopoulos, Raoul
    Schembra, Giovanni
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 459 - 465
  • [39] A REINFORCEMENT LEARNING APPROACH FOR OPTIMIZATION OF CHEMOTHERAPY AND ITS APPLICATION IN OPTIMAL CONTROL
    Pakdel, A. Fani
    Rezazadeh, M.
    Sustany, M. Naghiby
    ANNALS OF ONCOLOGY, 2012, 23 : 459 - 459
  • [40] Risk-averse Distributional Reinforcement Learning: A CVaR Optimization Approach
    Stanko, Silvestr
    Macek, Karel
    IJCCI: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2019, : 412 - 423