A Study on Data Pre-Processing and Accident Prediction Modelling for Occupational Accident Analysis in the Construction Industry

被引:15
|
作者
Lee, Jae Yun [1 ]
Yoon, Young Geun [1 ]
Oh, Tae Keun [1 ,2 ]
Park, Seunghee [3 ]
Ryu, Sang Il [4 ]
机构
[1] Incheon Natl Univ, Dept Safety Engn, Incheon 22012, South Korea
[2] Incheon Natl Univ, Res Inst Engn & Technol, Incheon 22012, South Korea
[3] Sungkyunkwan Univ, Sch Civil Architectural Engn & Landscape Architec, Gyeonggi 440746, South Korea
[4] Dong Eui Univ, Dept Fire Adm & Disaster Management, 176 Eomgwang Ro, Busan 47340, South Korea
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 21期
基金
新加坡国家研究基金会;
关键词
occupational accident; correlation analysis; support vector machine; ensemble; data preprocessing; latent class clustering analysis; alluvial flow diagram; SAFETY MANAGEMENT; DECISION; TREES; RISK;
D O I
10.3390/app10217949
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In the construction industry, it is difficult to predict occupational accidents because various accident characteristics arise simultaneously and organically in different types of work. Furthermore, even when analyzing occupational accident data, it is difficult to deduce meaningful results because the data recorded by the incident investigator are qualitative and include a wide variety of data types and categories. Recently, numerous studies have used machine learning to analyze the correlations in such complex construction accident data; however, heretofore the focus has been on predicting severity with various variables, and several limitations remain when deriving the correlations between features from various variables. Thus, this paper proposes a data processing procedure that can efficiently manipulate accident data using optimal machine learning techniques and derive and systematize meaningful variables to rationally approach such complex problems. In particular, among the various variables, the most influential variables are derived through methods such as clustering, chi-square, Cramer's V, and predictor importance; then, the analysis is simplified by optimally grouping the variables. For accident data with optimal variables and elements, a predictive model is constructed between variables, using a support vector machine and decision-tree-based ensemble; then, the correlation between the dependent and independent variables is analyzed through an alluvial flow diagram for several cases. Therefore, a new processing procedure has been introduced in data preprocessing and accident prediction modelling to overcome difficulties from complex and diverse construction occupational accident data, and effective accident prevention is possible by deriving correlations of construction accidents using this process.
引用
收藏
页码:1 / 23
页数:23
相关论文
共 50 条
  • [1] Accident Compensation of Occupational Fatalities in the Construction Industry
    Liao, Chia-Wen
    Chiang, Tsung-Lung
    [J]. CIVIL, STRUCTURAL AND ENVIRONMENTAL ENGINEERING, PTS 1-4, 2014, 838-841 : 3167 - +
  • [2] A Tree Based Approach for Data Pre-processing and Pattern Matching for Accident Mapping on Road Networks
    Kumar, Arvind
    Johari, Suchi
    Proch, Deepak
    Kumar, Pardeep
    Chauhan, Durg Singh
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES INDIA SECTION A-PHYSICAL SCIENCES, 2019, 89 (03) : 453 - 466
  • [3] A Tree Based Approach for Data Pre-processing and Pattern Matching for Accident Mapping on Road Networks
    Arvind Kumar
    Suchi Johari
    Deepak Proch
    Pardeep Kumar
    Durg Singh Chauhan
    [J]. Proceedings of the National Academy of Sciences, India Section A: Physical Sciences, 2019, 89 : 453 - 466
  • [4] Analysis of the Development of Accident Situations in the Construction Industry
    Hola, Bozena
    Szostak, Mariusz
    [J]. XXIII R-S-P SEMINAR, THEORETICAL FOUNDATION OF CIVIL ENGINEERING (23RSP) (TFOCE 2014), 2014, 91 : 429 - 434
  • [5] DEMOGRAPHIC ANALYSIS OF OCCUPATIONAL ACCIDENT OCCURRENCE IN MANUFACTURING INDUSTRY
    Rohani, Jafri Mohd
    Johari, Mohamed Fitri
    Hamid, Wan Harun Wan
    Atan, Hood
    [J]. JURNAL TEKNOLOGI, 2015, 77 (01):
  • [6] Accident patterns and prevention measures for fatal occupational falls in the construction industry
    Chi, CF
    Chang, TC
    Ting, HI
    [J]. APPLIED ERGONOMICS, 2005, 36 (04) : 391 - 400
  • [7] Data Pre-processing from Production Processes for Analysis in Automotive Industry
    Simoncicova, Veronika
    Hrcka, Lukas
    Tadanai, Ondrej
    Tanuska, Pavol
    Vazan, Pavel
    [J]. CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS (CECIIS 2016), 2016, : 17 - 21
  • [8] Work time loss prediction by exploiting occupational accident data
    Marcoulaki, E. C.
    Papazoglou, I. A.
    Konstandinidou, M.
    [J]. ADVANCES IN SAFETY, RELIABILITY AND RISK MANAGEMENT, 2012, : 124 - 130
  • [9] Analysis of importance of pre-processing in prediction of hypertension
    K. Shobha
    S. Nickolas
    [J]. CSI Transactions on ICT, 2018, 6 (2) : 209 - 214
  • [10] Causes and Effects of Accident at Construction Site: A Study for the Construction Industry in Bangladesh
    Ahmed, Shakil
    [J]. INTERNATIONAL JOURNAL OF SUSTAINABLE CONSTRUCTION ENGINEERING AND TECHNOLOGY, 2019, 10 (02): : 18 - 40