Construction-Accident Narrative Classification Using Shallow and Deep Learning

被引:13
|
作者
Qiao, Jianfeng [1 ,2 ]
Wang, Changfeng [3 ]
Guan, Shuang [3 ]
Shuran, Lv [1 ,2 ]
机构
[1] Capital Univ Econ & Business, Dept Management & Engn, Beijing 100070, Peoples R China
[2] Dept Beijing Key Lab Megareg Sustainable Dev Mode, Beijing 100070, Peoples R China
[3] Beijing Univ Posts & Telecommun, Dept Econ & Management, Beijing 100876, Peoples R China
关键词
Data mining; Natural language processing; Machine learning; Text mining; Autocoding; LARGE ADMINISTRATIVE DATABASES; CLASSIFYING INJURY NARRATIVES; TEXT CLASSIFICATION; CODING CAUSATION; NEURAL-NETWORKS; TOOL;
D O I
10.1061/(ASCE)CO.1943-7862.0002354
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
It is crucial to extract knowledge from past accidents to prevent future ones. To this end, narrative classification is required in text mining. This autocoding process can be seen as a multiclass classification problem with an imbalanced data set. We evaluated the performance of several state-of-the-art machine learning methods, including 10 shallow learning methods (Rocchio, k-nearest neighbors, linear regression, naive Bayes, decision tree, random forest, gradient boosting, bootstrap aggregating, support vector machine (SVM), and shallow neural network), and five deep learning methods [deep neural network, convolutional neural network (CNN), recurrent neural network with long short-term memory, and a gated recurrent unit, and recurrent CNN]. The input data set contained 4,770 construction accident reports from the Occupational Safety and Health Administration (OSHA). After the narratives were relabeled based on the Occupational Injury and Illness Classification System (OIICS), the accuracy of all shallow classifiers was significantly improved compared with that reported in previous studies. SVM and CNN achieved the highest accuracy of 0.91 and 0.90 among the shallow and deep learning methods, respectively. Misclassifications occur because training data sets lack rich diversity for minority classes, some cases belong to multiple classes, and some divisions have the same key feature words. In the future, when a new data set is available, we can use learned patterns to classify them with high accuracy in practice. (C) 2022 American Society of Civil Engineers.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Satellite Imagery Classification Using Shallow and Deep Learning Approaches
    Sainos-Vizuett, Michelle
    Hussein Lopez-Nava, Irvin
    [J]. PATTERN RECOGNITION (MCPR 2021), 2021, 12725 : 163 - 172
  • [2] Shallow and deep learning for image classification
    Ososkov G.
    Goncharov P.
    [J]. Optical Memory and Neural Networks, 2017, 26 (4) : 221 - 248
  • [3] Construction accident narrative classification: An evaluation of text mining techniques
    Goh, Yang Miang
    Ubeynarayana, C. U.
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2017, 108 : 122 - 130
  • [4] Investigating the Role of Clustering in Construction-Accident Severity Prediction Using a Heterogeneous and Imbalanced Data Set
    Salarian, Ali Akbar
    Etemadfard, Hossein
    Rahimzadegan, Ali
    Ghalehnovi, Mansour
    [J]. JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2023, 149 (02)
  • [5] Machine learning in construction: From shallow to deep learning
    Xu, Yayin
    Zhou, Ying
    Sekula, Przemyslaw
    Ding, Lieyun
    [J]. DEVELOPMENTS IN THE BUILT ENVIRONMENT, 2021, 6
  • [6] Remote Sensing Image Classification Using Deep-Shallow Learning
    Dou, Peng
    Shen, Huanfeng
    Li, Zhiwei
    Guan, Xiaobin
    Huang, Wenli
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 3070 - 3083
  • [7] Fake reviews classification using deep learning ensemble of shallow convolutions
    Javed, Muhammad Saad
    Majeed, Hammad
    Mujtaba, Hasan
    Beg, Mirza Omer
    [J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2021, 4 (02): : 883 - 902
  • [8] Fake reviews classification using deep learning ensemble of shallow convolutions
    Muhammad Saad Javed
    Hammad Majeed
    Hasan Mujtaba
    Mirza Omer Beg
    [J]. Journal of Computational Social Science, 2021, 4 : 883 - 902
  • [9] Shallow Classification or Deep Learning: An Experimental Study
    Yin, Xu-Cheng
    Yang, Chun
    Pei, Wei-Yi
    Hao, Hong-Wei
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1904 - 1909
  • [10] Deep learning with Shallow architecture for Image Classification
    ElAdel, Asma
    Ejbali, Ridha
    Zaied, Mourad
    Ben Amar, Chokri
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2015), 2015, : 408 - 412