A reinforcement learning approach for waterflooding optimization in petroleum reservoirs

被引：37

作者：

Hourfar, Farzad ^{[1
]}

Bidgoly, Hamed Jalaly ^{[1
]}

Moshiri, Behzad ^{[1
]}

Salahshoor, Karim ^{[2
]}

Elkamel, Ali ^{[3
,4
]}

机构：

[1] Univ Tehran, CIPCE, Sch Elect & Comp Engn, Tehran, Iran

[2] Petr Univ Technol, Dept Automat & Instrumentat Engn, Ahvaz, Iran

[3] Univ Waterloo, Dept Chem Engn, Waterloo, ON, Canada

[4] Khalifa Univ, Petr Inst, Dept Chem Engn, Abu Dhabi, U Arab Emirates

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2019年 / 77卷

关键词：

Waterflooding process; Reinforcement learning; Production optimization; Closed-loop reservoir management; Derivative-free optimization; MULTIPHASE FLOW; SUBSURFACE FLOW; TERM PRODUCTION; POROUS-MEDIA; OIL-FIELD; LONG-TERM; MANAGEMENT; MODEL; TIME; PERFORMANCE;

D O I：

10.1016/j.engappai.2018.09.019

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Waterflooding optimization in closed-loop management of the oil reservoirs is always considered as a challenging issue due to the complicated and unpredicted dynamics of the process. The main goal in waterflooding is to adjust the manipulated variables such that the total oil production or a defined objective function, which has a strong correlation with the gained financial profit, is maximized. Fortunately, due to the recent progresses in the computational tools and also expansion of the calculating facilities, utilization of non-conventional optimization methods is feasible to achieve the desired goals. In this paper, waterflooding optimization problem has been defined and formulated in the framework of Reinforcement Learning (RL) methodology, which is known as a derivative-free and also model-free optimization approach. This technique prevents from the challenges corresponding with the complex gradient calculations for handling the objective functions. So, availability of explicit dynamic models of the reservoir for gradient computations is not mandatory to apply the proposed method. The developed algorithm provides the facility to achieve the desired operational targets, by appropriately defining the learning problem and the necessary variables. The fundamental learning elements such as actions, states, and rewards have been delineated both in discrete and continuous domain. The proposed methodology has been implemented and assessed on the Egg-model which is a popular and well-known reservoir case study. Different configurations for active injection and production wells have been taken into account to simulate Single-Input-Multi-Output (SIMO) as well as Multi-Input-Multi-Output (MIMO) optimization scenarios. The results demonstrate that the "agent" is able to gradually, but successfully learn the most appropriate sequence of actions tailored for each practical scenario. Consequently, the manipulated variables (actions) are set optimally to satisfy the defined production objectives which are generally dictated by the management level or even contractual obligations. Moreover, it has been shown that by properly adjustment of the rewarding policies in the learning process, diverse forms of multi-objective optimization problems can be formulated, analyzed and solved.

引用

页码：98 / 116

页数：19

共 50 条

[41] A Deep Reinforcement Learning Approach to the Optimization of Data Center Task Scheduling
Che, Haiying
Bai, Zixing
Zuo, Rong
Li, Honglei
COMPLEXITY, 2020, 2020
[42] A Reinforcement Learning Approach to Dynamic Optimization of Load Allocation in AGC System
Wang, Y. M.
Liu, Q. J.
Yu, T.
2009 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, VOLS 1-8, 2009, : 3704 - 3709
[43] MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization
Niu, Hui
Li, Siyuan
Li, Jian
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1573 - 1583
[44] Test Suite Prioritization Based on Optimization Approach Using Reinforcement Learning
Waqar, Muhammad
Imran
Zaman, Muhammad Atif
Muzammal, Muhammad
Kim, Jungsuk
APPLIED SCIENCES-BASEL, 2022, 12 (13):
[45] AN EFFICIENT REINFORCEMENT LEARNING BASED APPROACH FOR SDN CONTROLLER PLACEMENT OPTIMIZATION
Aboelela, Omnia A.
Sadek, Rowayda A.
2024 41ST NATIONAL RADIO SCIENCE CONFERENCE, NRSC 2024, 2024, : 126 - 135
[46] Parameter Optimization of Multiple Resonant Controller: A Deep Reinforcement Learning Approach
Zhang, Xiaojie
Lei, Wanjun
Dai, Yuqi
Tang, Qibo
Yuan, Xiaojie
Xiao, Zhongxiu
Lv, Gaotai
2020 IEEE 9TH INTERNATIONAL POWER ELECTRONICS AND MOTION CONTROL CONFERENCE (IPEMC2020-ECCE ASIA), 2020, : 2578 - 2581
[47] THE EFFECT OF FLOW BARRIERS ON WATERFLOODING OF HETEROGENEOUS RESERVOIRS
WEBER, R
PUSCH, G
MULLER, T
ERDOL & KOHLE ERDGAS PETROCHEMIE, 1987, 40 (11): : 467 - 474
[48] A New Method for Development Evaluation of Waterflooding Reservoirs
Feng, Jinde
Luo, Ruilan
Tang, Wei
Zhang, Hujun
Liu, Ting
Wang, Donghui
PROCEEDINGS OF THE INTERNATIONAL FIELD EXPLORATION AND DEVELOPMENT CONFERENCE 2017, 2019, : 1414 - 1422
[49] WATERFLOODING OF RESERVOIRS WITH OIL-SATURATED THICKNESS
SAZONOV, BF
ZHITOMIRSKII, VM
KOVALEVA, GA
NEFTYANOE KHOZYAISTVO, 1989, (08): : 36 - &
[50] WATERFLOODING OIL-RESERVOIRS WITH BOTTOM WATER
ISLAM, MR
ALI, SMF
JOURNAL OF CANADIAN PETROLEUM TECHNOLOGY, 1989, 28 (03): : 59 - 66

← 1 2 3 4 5 →