Ensemble regression based on polynomial regression-based decision tree and its application in the in-situ data of tunnel boring machine

被引:99
|
作者
Shi, Maolin [1 ,2 ]
Hu, Weifei [3 ]
Li, Muxi [4 ]
Zhang, Jian [4 ]
Song, Xueguan [5 ]
Sun, Wei [5 ]
机构
[1] Jiangsu Univ, Sch Agr Engn, Zhenjiang 212013, Peoples R China
[2] Jiangsu Univ, Zhonghui Rubber Technol Co Ltd, Wuxi 214183, Peoples R China
[3] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou 310027, Peoples R China
[4] Jiangsu Univ, Fac Civil Engn & Mech, Inst Struct Hlth Monitoring, Zhenjiang 212013, Peoples R China
[5] Dalian Univ Technol, Sch Mech Engn, Dalian 116024, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Regression; Ensemble learning; Bagging; Polynomial regression; Decision tree; Tunnel boring machine; PREDICTION;
D O I
10.1016/j.ymssp.2022.110022
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Regression is an important branch of engineering data mining tasks, aiming to establish a regression model to predict the output of interest based on the input variables. To meet the requirements of different missions, the engineering system usually changes its operation status so that the regression relationship between the output and input variables changes. In this paper, two ensemble regression methods are proposed based on polynomial regression and decision tree, in which sample space partition is used to improve the prediction accuracy and ensemble strategy is used to improve the performance robustness of the regression model. The first ensemble regression method (named PRB) is developed under the framework of bagging. The second ensemble regression method (named PRF) is similar to the first one, but feature randomness is introduced. At each node of the polynomial regression-based decision tree, the polynomial regression error is used to select the best splitting feature. The experiments on a series of mathematical functions and engineering datasets indicate that the proposed ensemble regression methods outperform the polynomial regression-based decision tree, the polynomial regression method, and the random forest method in most experiments. The proposed ensemble regression methods are applied to model the dataset of a tunnel boring machine, aiming to predict the earth pressure based on the operation parameters of the cutterhead. The results indicate that the proposed two ensemble regression methods produce more accurate prediction results, and the PRF method performs best in most experiments.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Performance Evaluation of Regression-Based Machine Learning Models for Modeling Reference Evapotranspiration with Temperature Data
    Diamantopoulou, Maria J.
    Papamichail, Dimitris M.
    HYDROLOGY, 2024, 11 (07)
  • [42] Complementing real datasets with simulated data: a regression-based approach
    M. A. Ortiz-Barrios
    J. Lundström
    J. Synnott
    E. Järpe
    A. Sant’Anna
    Multimedia Tools and Applications, 2020, 79 : 34301 - 34324
  • [43] Multinomial logistic regression-based feature selection for hyperspectral data
    Pal, Mahesh
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2012, 14 (01): : 214 - 220
  • [44] Logistic regression-based pattern classifiers for symbolic interval data
    de Souza, Renata M. C. R.
    Queiroz, Diego C. F.
    Cysneiros, Francisco Jose A.
    PATTERN ANALYSIS AND APPLICATIONS, 2011, 14 (03) : 273 - 282
  • [45] Probabilistic interpretation of regression-based downscaled seasonal ensemble predictions with the estimation of uncertainty
    Min, Young-Mi
    Kryjov, Vladimir N.
    Oh, Jai-Ho
    JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2011, 116
  • [46] A Robust Approach of Regression-Based Statistical Matching for Continuous Data
    Sohn, Sooncheol
    Jhun, Myoungshic
    KOREAN JOURNAL OF APPLIED STATISTICS, 2012, 25 (02) : 331 - 339
  • [47] Logistic regression-based pattern classifiers for symbolic interval data
    Renata M. C. R. de Souza
    Diego C. F. Queiroz
    Francisco José A. Cysneiros
    Pattern Analysis and Applications, 2011, 14 : 273 - 282
  • [48] Linear regression-based feature selection for microarray data classification
    Hasan, Md Abid
    Hasan, Md Kamrul
    Mottalib, M. Abdul
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 11 (02) : 167 - 179
  • [49] A plane regression-based sequence forecast algorithm for stream data
    Zhao, F
    Li, QH
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 1559 - 1562
  • [50] A Penalized Regression-Based Biclustering Approach in Gene Expression Data
    Wei, Mengxi
    Zheng, Zhi
    Zhang, Weiping
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2024,