Improvement of random forest by multiple imputation applied to tower crane accident prediction with missing data

被引:14
|
作者
Jiang, Ling [1 ]
Zhao, Tingsheng [1 ]
Feng, Chuxuan [1 ]
Zhang, Wei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
基金
国家重点研发计划;
关键词
Random forest; Tower crane; Missing data; Accident prediction; Multiple imputation; CONSTRUCTION SITES; SAFETY; SELECTION;
D O I
10.1108/ECAM-07-2021-0606
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Purpose This research is aimed at predicting tower crane accident phases with incomplete data. Design/methodology/approach The tower crane accidents are collected for prediction model training. Random forest (RF) is used to conduct prediction. When there are missing values in the new inputs, they should be filled in advance. Nevertheless, it is difficult to collect complete data on construction site. Thus, the authors use multiple imputation (MI) method to improve RF. Finally the prediction model is applied to a case study. Findings The results show that multiple imputation RF (MIRF) can effectively predict tower crane accident when the data are incomplete. This research provides the importance rank of tower crane safety factors. The critical factors should be focused on site, because the missing data affect the prediction results seriously. Also the value of critical factors influences the safety of tower crane. Practical implication This research promotes the application of machine learning methods for accident prediction in actual projects. According to the onsite data, the authors can predict the accident phase of tower crane. The results can be used for tower crane accident prevention. Originality/value Previous studies have seldom predicted tower crane accidents, especially the phase of accident. This research uses tower crane data collected on site to predict the phase of the tower crane accident. The incomplete data collection is considered in this research according to the actual situation.
引用
收藏
页码:1222 / 1242
页数:21
相关论文
共 50 条
  • [1] Multiple imputation of ordinal missing not at random data
    Hammon, Angelina
    ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2023, 107 (04) : 671 - 692
  • [2] Multiple imputation of ordinal missing not at random data
    Angelina Hammon
    AStA Advances in Statistical Analysis, 2023, 107 : 671 - 692
  • [3] Multiple imputation of binary multilevel missing not at random data
    Hammon, Angelina
    Zinn, Sabine
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2020, 69 (03) : 547 - 564
  • [4] Missing Data Imputation Through the Use of the Random Forest Algorithm
    Pantanowitz, Adam
    Marwala, Tshilidzi
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, 2009, 61 : 53 - 62
  • [5] Missing Data and Multiple Imputation in Rheumatoid Arthritis Registries Using Sequential Random Forest Method
    Al-Saber, Ahmed
    Al-Herz, Adeeba
    Pan, Jiazhu
    Saleh, Khulood
    Al-Awadhi, Adel
    Al-Kandari, Waleed
    Hasan, Eman
    Ghanem, Aqeel
    Hussain, Mohammed
    Ali, Yaser
    Nahar, Ebrahim
    Alenizi, Ahmad
    Hayat, Sawsan
    Abutiban, Fatemah
    Aldei, Ali
    Alkadi, Amjad
    Alhajeri, Heba
    Behbehani, Husain
    Alhadhood, Naser
    Mokaddem, Khaled
    Khadrawy, Ahmed
    Fazal, Ammad
    Zaman, Agaz
    Mazloum, Ghada
    Bartella, Youssef
    Hamed, Sally
    Alsouk, Ramia
    ARTHRITIS & RHEUMATOLOGY, 2020, 72
  • [6] MISSING DATA AND MULTIPLE IMPUTATION IN RHEUMATOID ARTHRITIS REGISTRIES USING SEQUENTIAL RANDOM FOREST METHOD
    Alsaber, A.
    Al-Herz, A.
    Pan, J.
    Saleh, K.
    Al-Awadhi, A.
    Al-Kandari, W.
    Hasan, E.
    Ghanem, A.
    Hussain, M.
    Ali, Y.
    Nahar, E.
    Alenizi, A.
    Hayat, S.
    Abutiban, F.
    Aledei, A.
    Al-Qadhi, A.
    Alhajeri, H.
    Behbehani, H.
    Alhadhood, N.
    ANNALS OF THE RHEUMATIC DISEASES, 2020, 79 : 515 - 515
  • [7] Missing Data Imputation Method Combining Random Forest and Generative Adversarial Imputation Network
    Ou, Hongsen
    Yao, Yunan
    He, Yi
    SENSORS, 2024, 24 (04)
  • [8] Auxiliary Variables in Multiple Imputation When Data Are Missing Not at Random
    Mustillo, Sarah
    Kwon, Soyoung
    JOURNAL OF MATHEMATICAL SOCIOLOGY, 2015, 39 (02): : 73 - 91
  • [9] Missing Data and Multiple Imputation
    Cummings, Peter
    JAMA PEDIATRICS, 2013, 167 (07) : 656 - 661
  • [10] Multiple imputation for missing data
    Patrician, PA
    RESEARCH IN NURSING & HEALTH, 2002, 25 (01) : 76 - 84