Towards Automated Lithology Classification in NATM Tunnel: A Data-Driven Solution for Multi-dimensional Imbalanced Data

被引:0
|
作者
Li, Yang [1 ,2 ]
Chen, Jiayao [1 ,2 ,4 ]
Fang, Qian [1 ,2 ]
Zhang, Dingli [1 ,2 ]
Huang, Wengui [3 ]
机构
[1] Beijing Jiaotong Univ, Sch Civil Engn, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Key Lab Urban Underground Engn, Minist Educ, Beijing 100044, Peoples R China
[3] Teesside Univ, Sch Comp Engn & Digital Technol, Middlesbrough TS1 3BA, England
[4] East China Jiaotong Univ, State Key Lab Performance Monitoring & Protecting, Nanchang, Jiangxi, Peoples R China
关键词
New Austrian tunneling method; Measurement-while-drilling; Lithology classification; Machine learning; Multi-dimensional imbalanced data; ROCK STRENGTH PARAMETERS; RANDOM FORESTS; PREDICTION; SYSTEM; RECOGNITION; TECHNOLOGY; TESTS; MODEL; INDEX;
D O I
10.1007/s00603-024-04287-6
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
To fully grasp the lithology of unexcavated tunnel geology, a correlation database using measurement-while-drilling (MWD) information from the NATM tunnel excavation process was established, resulting in a multi-dimensional imbalanced dataset consisting of 7216 entries. By integrating borehole imaging and expert interpretation, drilling parameters were aligned with lithology data. A hybrid ensemble model, combining adaptive synthetic sampling (ADASYN), grid search (GS) hyperparameter optimization, and eXtreme gradient boosting (XGBoost), is proposed for intelligent lithology classification. Various machine learning models, incorporating hyperparameter optimization and oversampling algorithms, were employed, cumulatively generating 12 classifiers for Macro F1 performance comparison. Comprehensive analysis showed that the GS-ADASYN-XGBoost algorithm outperformed the other hybrid models in classifying different lithologies. Water pressure was identified as the key feature influencing lithology classification, followed by water flow. Setting the oversampling proportion to 0.2, the ADASYN method effectively optimized the data imbalance ratio, significantly enhancing classifier performance. This improvement was most notable for the least represented lithology category, chlorite, with an increase of 1.27 times compared to no oversampling. The proposed model provides valuable insights for geological interpretation of the tunnel face. A hybrid GS-ADASYN-XGBoost model is proposed for classifying lithologies.A database with 7216 MWD from NATM tunnel excavation is established.Borehole imaging and expert interpretation align drilling parameters with lithology.Multi-dimensional data imbalance is effectively optimized by ADASYN.
引用
收藏
页码:2349 / 2366
页数:18
相关论文
共 50 条
  • [21] Multi-dimensional Data Inspection for Supervised Classification with Eigen Transformation Classification Trees
    De Bruyne, Steven
    Plastria, Frank
    PRICAI 2010: TRENDS IN ARTIFICIAL INTELLIGENCE, 2010, 6230 : 583 - +
  • [22] Towards Data-Driven Hybrid Composition of Data Mining Multi-agent Systems
    Neruda, Roman
    SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2009, 209 : 271 - 281
  • [23] Multi-dimensional failure risk load prediction of electric motor based on physical model and data-driven approach
    Wang, Zhen
    Zhao, Lihui
    Zhang, Dongdong
    Kong, Zhiguo
    Qin, Chunyang
    Yan, Chuliang
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART O-JOURNAL OF RISK AND RELIABILITY, 2025,
  • [24] Data-Driven Nonlinear Near-Optimal Regulation Based on Multi-Dimensional Taylor Network Dynamic Programming
    Sun, Qi-Ming
    Zhang, Chao
    Jiang, Nan-Yun
    Yu, Jing-Jing
    Xu, Lei
    IEEE ACCESS, 2020, 8 : 36476 - 36484
  • [25] Multi-dimensional features based data-driven state of charge estimation method for LiFePO4 batteries
    Liu, Mengmeng
    Xu, Jun
    Jiang, Yihui
    Mei, Xuesong
    ENERGY, 2023, 274
  • [26] Data-driven nonlinear near-optimal regulation based on multi-dimensional taylor network dynamic programming
    Sun, Qi-Ming
    Zhang, Chao
    Jiang, Nan-Yun
    Yu, Jing-Jing
    Xu, Lei
    Sun, Qi-Ming (sqm1122345@126.com), 1600, Institute of Electrical and Electronics Engineers Inc., United States (08): : 36476 - 36484
  • [27] Impacts of Multi-Dimensional Geometrical Uncertainties on Field Characteristics of Traveling-Wave Tube in Data-Driven Perspective
    Liu, Kegang
    Xue, Qianzhong
    Guo, Naining
    Song, Wenke
    Zhao, Ding
    Ding, Haibing
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2022, 69 (03) : 1435 - 1441
  • [28] Active Pattern Classification for Automatic Visual Exploration of Multi-Dimensional Data
    Li, Jie
    Tan, Huailian
    Huang, Wentao
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [29] MVPA-Light: A Classification and Regression Toolbox for Multi-Dimensional Data
    Treder, Matthias S.
    FRONTIERS IN NEUROSCIENCE, 2020, 14
  • [30] Comparison of neural and statistical algorithms for supervised classification of multi-dimensional data
    Li, TS
    Chen, CY
    Su, CT
    INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2003, 10 (01): : 73 - 81