Forming a new small sample deep learning model to predict total organic carbon content by combining unsupervised learning with semisupervised learning

被引：62

作者：

Zhu, Linqi ^{[1
,2
]}

Zhang, Chong ^{[1
,2
]}

Zhang, Chaomo ^{[1
,2
]}

Zhang, Zhansong ^{[1
,2
]}

Nie, Xin ^{[1
,2
]}

Zhou, Xueqing ^{[1
,2
]}

Liu, Weinan ^{[1
,2
]}

Wang, Xiu ^{[1
,2
]}

机构：

[1] Yangtze Univ, Minist Educ, Key Lab Explorat Technol Oil & Gas Resources, Wuhan 430100, Hubei, Peoples R China

[2] Yangtze Univ, Hubei Cooperat Innovat Ctr Unconvent Oil & Gas, Wuhan 430100, Hubei, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2019年 / 83卷

基金：

中国国家自然科学基金;

关键词：

Small sample; Deep learning; Integrated deep learning model; Coarse-detailed feature extraction; Total organic carbon content; NEURAL-NETWORKS; GAS-FIELD; MACHINE; SHALE; REGRESSION; LOGS; INTELLIGENT; FRAMEWORK; RESERVOIR;

D O I：

10.1016/j.asoc.2019.105596

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The total organic carbon (TOC) content is a parameter that is directly used to evaluate the hydrocarbon generation capacity of a reservoir. For a reservoir, accurately calculating TOC using well logging curves is a problem that needs to be solved. Machine learning models usually yield the most accurate results. Problems of existing machine learning models that are applied to well logging interpretations include poor feature extraction methods and limited ability to learn complex functions. However, logging interpretation is a small sample problem, and traditional deep learning with strong feature extraction ability cannot be directly used; thus, a deep learning model suitable for logging small sample features, namely, a combination of unsupervised learning and semisupervised learning in an integrated DLM (IDLM), is proposed in this paper and is applied to the TOC prediction problem. This study is also the first systematic application of a deep learning model in a well logging interpretation. First, the model uses a stacked extreme learning machine sparse autoencoder (SELM-SAE) unsupervised learning method to perform coarse feature extraction for a large number of unlabeled samples, and a feature extraction layer consisting of multiple hidden layers is established. Then, the model uses the deep Boltzmann machine (DBM) semisupervised learning method to learn a large number of unlabeled samples and a small number of labeled samples (the input is extracted from logging curve values into SELM-SAE extracted features), and the SELM-SAE and DBM are integrated to form a deep learning model (DLM). Finally, multiple DLMs are combined to form an IDLM algorithm through an improved weighted bagging algorithm. A total of 2381 samples with an unlabeled logging response from 4 wells in 2 shale gas areas and 326 samples with determined TOC values are used to train the model. The model is compared with 11 other machine learning models, and the IDLM achieves the highest precision. Moreover, the simulation shows that for the TOC prediction problem, when the number of labeled samples included in the training is greater than 20, even if this number of samples is used to train 10 hidden layer IDLMs, the trained model has a very low overfitting probability and exhibits the potential to exceed the accuracies of other models. Relative to the existing mainstream shallow model, the IDLM based on a DLM provides the most advanced performance and is more effective. This method implements a small sample deep learning algorithm for TOC prediction and can feasibly use deep learning to solve logging interpretation problems and other small sample set problems for the first time. The IDLM achieves high precision and provides novel insights that can aid in oil and gas exploration and development. (C) 2019 Elsevier B.V. All rights reserved.

引用

页数：23

共 50 条

[1] Combining supervised and unsupervised machine learning algorithms to predict the learners' learning styles
El Aissaoui, Ouafae
El Alami El Madani, Yasser
Oughdir, Lahcen
El Allioui, Youssouf
SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 : 87 - 96
[2] Supervised, semisupervised, and unsupervised learning of the Domany-Kinzel model
Tuo, Kui
Li, Wei
Deng, Shengfeng
Zhu, Yueying
PHYSICAL REVIEW E, 2024, 110 (02)
[3] Learning from Small Sample Sets by Combining Unsupervised Meta-Training with CNNs
Wang, Yu-Xiong
Hebert, Martial
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[4] Deep learning of total electron content
Omid Memarian Sorkhabi
SN Applied Sciences, 2021, 3
[5] Deep learning of total electron content
Sorkhabi, Omid Memarian
SN APPLIED SCIENCES, 2021, 3 (07):
[6] Prediction of Total Organic Carbon Content in Deep Marine Shale Reservoirs Based on a Super Hybrid Machine Learning Model
Liu, Yi
Li, Na
Li, Chengyong
Jiang, Jiayu
Wu, Xiuhui
Liang, Haipeng
Zhang, Dongxu
Hu, Xiuquan
ENERGY & FUELS, 2024, 38 (18) : 17483 - 17498
[7] Research on the Deep Learning of the Small Sample Data based on Transfer Learning
Zhao, Wei
GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
[8] A Novel Deep Density Model for Unsupervised Learning
Yang, Xi
Huang, Kaizhu
Zhang, Rui
Goulermas, John Y.
COGNITIVE COMPUTATION, 2019, 11 (06) : 778 - 788
[9] A Novel Deep Density Model for Unsupervised Learning
Xi Yang
Kaizhu Huang
Rui Zhang
John Y. Goulermas
Cognitive Computation, 2019, 11 : 778 - 788
[10] Combining signal decomposition and deep learning model to predict noisy runoff coefficient
Rahi, Arash
Rahmati, Mehdi
Dari, Jacopo
Bogena, Heye
Vereecken, Harry
Morbidelli, Renato
JOURNAL OF HYDROLOGY, 2024, 641

← 1 2 3 4 5 →