Forming a new small sample deep learning model to predict total organic carbon content by combining unsupervised learning with semisupervised learning

被引:62
|
作者
Zhu, Linqi [1 ,2 ]
Zhang, Chong [1 ,2 ]
Zhang, Chaomo [1 ,2 ]
Zhang, Zhansong [1 ,2 ]
Nie, Xin [1 ,2 ]
Zhou, Xueqing [1 ,2 ]
Liu, Weinan [1 ,2 ]
Wang, Xiu [1 ,2 ]
机构
[1] Yangtze Univ, Minist Educ, Key Lab Explorat Technol Oil & Gas Resources, Wuhan 430100, Hubei, Peoples R China
[2] Yangtze Univ, Hubei Cooperat Innovat Ctr Unconvent Oil & Gas, Wuhan 430100, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Small sample; Deep learning; Integrated deep learning model; Coarse-detailed feature extraction; Total organic carbon content; NEURAL-NETWORKS; GAS-FIELD; MACHINE; SHALE; REGRESSION; LOGS; INTELLIGENT; FRAMEWORK; RESERVOIR;
D O I
10.1016/j.asoc.2019.105596
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The total organic carbon (TOC) content is a parameter that is directly used to evaluate the hydrocarbon generation capacity of a reservoir. For a reservoir, accurately calculating TOC using well logging curves is a problem that needs to be solved. Machine learning models usually yield the most accurate results. Problems of existing machine learning models that are applied to well logging interpretations include poor feature extraction methods and limited ability to learn complex functions. However, logging interpretation is a small sample problem, and traditional deep learning with strong feature extraction ability cannot be directly used; thus, a deep learning model suitable for logging small sample features, namely, a combination of unsupervised learning and semisupervised learning in an integrated DLM (IDLM), is proposed in this paper and is applied to the TOC prediction problem. This study is also the first systematic application of a deep learning model in a well logging interpretation. First, the model uses a stacked extreme learning machine sparse autoencoder (SELM-SAE) unsupervised learning method to perform coarse feature extraction for a large number of unlabeled samples, and a feature extraction layer consisting of multiple hidden layers is established. Then, the model uses the deep Boltzmann machine (DBM) semisupervised learning method to learn a large number of unlabeled samples and a small number of labeled samples (the input is extracted from logging curve values into SELM-SAE extracted features), and the SELM-SAE and DBM are integrated to form a deep learning model (DLM). Finally, multiple DLMs are combined to form an IDLM algorithm through an improved weighted bagging algorithm. A total of 2381 samples with an unlabeled logging response from 4 wells in 2 shale gas areas and 326 samples with determined TOC values are used to train the model. The model is compared with 11 other machine learning models, and the IDLM achieves the highest precision. Moreover, the simulation shows that for the TOC prediction problem, when the number of labeled samples included in the training is greater than 20, even if this number of samples is used to train 10 hidden layer IDLMs, the trained model has a very low overfitting probability and exhibits the potential to exceed the accuracies of other models. Relative to the existing mainstream shallow model, the IDLM based on a DLM provides the most advanced performance and is more effective. This method implements a small sample deep learning algorithm for TOC prediction and can feasibly use deep learning to solve logging interpretation problems and other small sample set problems for the first time. The IDLM achieves high precision and provides novel insights that can aid in oil and gas exploration and development. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Combining supervised and unsupervised machine learning algorithms to predict the learners' learning styles
    El Aissaoui, Ouafae
    El Alami El Madani, Yasser
    Oughdir, Lahcen
    El Allioui, Youssouf
    SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 : 87 - 96
  • [2] Supervised, semisupervised, and unsupervised learning of the Domany-Kinzel model
    Tuo, Kui
    Li, Wei
    Deng, Shengfeng
    Zhu, Yueying
    PHYSICAL REVIEW E, 2024, 110 (02)
  • [3] Learning from Small Sample Sets by Combining Unsupervised Meta-Training with CNNs
    Wang, Yu-Xiong
    Hebert, Martial
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [4] Deep learning of total electron content
    Omid Memarian Sorkhabi
    SN Applied Sciences, 2021, 3
  • [5] Deep learning of total electron content
    Sorkhabi, Omid Memarian
    SN APPLIED SCIENCES, 2021, 3 (07):
  • [6] Prediction of Total Organic Carbon Content in Deep Marine Shale Reservoirs Based on a Super Hybrid Machine Learning Model
    Liu, Yi
    Li, Na
    Li, Chengyong
    Jiang, Jiayu
    Wu, Xiuhui
    Liang, Haipeng
    Zhang, Dongxu
    Hu, Xiuquan
    ENERGY & FUELS, 2024, 38 (18) : 17483 - 17498
  • [7] Research on the Deep Learning of the Small Sample Data based on Transfer Learning
    Zhao, Wei
    GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
  • [8] A Novel Deep Density Model for Unsupervised Learning
    Yang, Xi
    Huang, Kaizhu
    Zhang, Rui
    Goulermas, John Y.
    COGNITIVE COMPUTATION, 2019, 11 (06) : 778 - 788
  • [9] A Novel Deep Density Model for Unsupervised Learning
    Xi Yang
    Kaizhu Huang
    Rui Zhang
    John Y. Goulermas
    Cognitive Computation, 2019, 11 : 778 - 788
  • [10] Combining signal decomposition and deep learning model to predict noisy runoff coefficient
    Rahi, Arash
    Rahmati, Mehdi
    Dari, Jacopo
    Bogena, Heye
    Vereecken, Harry
    Morbidelli, Renato
    JOURNAL OF HYDROLOGY, 2024, 641