REINFORCEMENT LEARNING FROM PIXELS: WATERFLOODING OPTIMIZATION

被引:0
|
作者
Miftakhov, Ruslan [1 ]
Efremov, Igor [1 ]
Al-Qasim, Abdulaziz S. [2 ]
机构
[1] GridPoint Dynam, Moscow, Russia
[2] Saudi Aramco, Dhahran, Eastern Provinc, Saudi Arabia
关键词
Reinforcement Learning; Optimization; Reservoir Simulation; Waterflooding; PERFORMANCE; FIELD; FLOW;
D O I
暂无
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
The application of Artificial Intelligence (AI) methods in the petroleum industry gain traction in recent years. In this paper, Deep Reinforcement Learning (RL) is used to maximize the Net Present Value ( NPV) of waterflooding by changing the water injection rate. This research is the first step towards showing that the use of pixel information for reinforcement learning provides many advantages, such as a fundamental understanding of reservoir physics by controlling changes in pressure and saturation without directly accounting for the reservoir petrophysical properties and wells. The optimization routine based on RL by pixel data is tested on the 2D model, which is a vertical section of the SPE 10 model. It has been shown that RL can optimize waterflooding in a 2D compressible reservoir with the 2-phase flow (oil-water). The proposed optimization method is an iterative process. In the first few thousands of updates, NPV remains in the baseline since it takes more time to converge from raw pixel data than to use classical well production/injection rate information. RL optimization resulted in improving the NPV by 15 percent, where the optimum scenario shows less watercut values and more stable production in contrast to baseline optimization. Additionally, we evaluated the impact of selecting the different action set for optimization and examined two cases where water injection well can change injection pressure with a step of 200 psi and 600 psi. The results show that in the second case, RL optimization is exploiting the limitation of the reservoir simulation engine and tries to imitate a cycled injection regime, which results in a 7% higher NPV than the first case.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Fluid mixing optimization with reinforcement learning
    Mikito Konishi
    Masanobu Inubushi
    Susumu Goto
    Scientific Reports, 12
  • [22] Disassembly line optimization with reinforcement learning
    Kegyes, Tamas
    Sule, Zoltan
    Abonyi, Janos
    CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH, 2024, 32 (04) : 1115 - 1142
  • [23] Reinforcement learning for deep portfolio optimization
    Yan, Ruyu
    Jin, Jiafei
    Han, Kun
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (09): : 5176 - 5200
  • [24] Multidisciplinary Optimization in Decentralized Reinforcement Learning
    Thanh Nguyen
    Mukhopadhyay, Snehasis
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 779 - 784
  • [25] Device Placement Optimization with Reinforcement Learning
    Mirhoseini, Azalia
    Pham, Hieu
    Le, Quoc, V
    Steiner, Benoit
    Larsen, Rasmus
    Zhou, Yuefeng
    Kumar, Naveen
    Norouzi, Mohammad
    Bengio, Samy
    Dean, Jeff
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [26] Parametric Circuit Optimization with Reinforcement Learning
    Tang, Changcheng
    Ye, Zuochang
    Wang, Yan
    2018 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2018, : 197 - 202
  • [27] Reinforcement Learning for Radar Waveform Optimization
    Coutino, Mario
    Uysal, Faruk
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [28] Reinforcement learning for opportunistic maintenance optimization
    Kuhnle, Andreas
    Jakubik, Johannes
    Lanza, Gisela
    PRODUCTION ENGINEERING-RESEARCH AND DEVELOPMENT, 2019, 13 (01): : 33 - 41
  • [29] Accelerating Quadratic Optimization with Reinforcement Learning
    Ichnowski, Jeffrey
    Jain, Paras
    Stellato, Bartolomeo
    Banjac, Goran
    Luo, Michael
    Borrelli, Francesco
    Gonzalez, Joseph E.
    Stoica, Ion
    Goldberg, Ken
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [30] Optimization of colour generation from dielectric nanostructures using reinforcement learning
    Sajedian, Iman
    Badloe, Trevon
    Rho, Junsuk
    OPTICS EXPRESS, 2019, 27 (04): : 5874 - 5883