Construction-Accident Narrative Classification Using Shallow and Deep Learning

被引：13

作者：

Qiao, Jianfeng ^{[1
,2
]}

Wang, Changfeng ^{[3
]}

Guan, Shuang ^{[3
]}

Shuran, Lv ^{[1
,2
]}

机构：

[1] Capital Univ Econ & Business, Dept Management & Engn, Beijing 100070, Peoples R China

[2] Dept Beijing Key Lab Megareg Sustainable Dev Mode, Beijing 100070, Peoples R China

[3] Beijing Univ Posts & Telecommun, Dept Econ & Management, Beijing 100876, Peoples R China

来源：

JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT | 2022年 / 148卷 / 09期

关键词：

Data mining; Natural language processing; Machine learning; Text mining; Autocoding; LARGE ADMINISTRATIVE DATABASES; CLASSIFYING INJURY NARRATIVES; TEXT CLASSIFICATION; CODING CAUSATION; NEURAL-NETWORKS; TOOL;

D O I：

10.1061/(ASCE)CO.1943-7862.0002354

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

It is crucial to extract knowledge from past accidents to prevent future ones. To this end, narrative classification is required in text mining. This autocoding process can be seen as a multiclass classification problem with an imbalanced data set. We evaluated the performance of several state-of-the-art machine learning methods, including 10 shallow learning methods (Rocchio, k-nearest neighbors, linear regression, naive Bayes, decision tree, random forest, gradient boosting, bootstrap aggregating, support vector machine (SVM), and shallow neural network), and five deep learning methods [deep neural network, convolutional neural network (CNN), recurrent neural network with long short-term memory, and a gated recurrent unit, and recurrent CNN]. The input data set contained 4,770 construction accident reports from the Occupational Safety and Health Administration (OSHA). After the narratives were relabeled based on the Occupational Injury and Illness Classification System (OIICS), the accuracy of all shallow classifiers was significantly improved compared with that reported in previous studies. SVM and CNN achieved the highest accuracy of 0.91 and 0.90 among the shallow and deep learning methods, respectively. Misclassifications occur because training data sets lack rich diversity for minority classes, some cases belong to multiple classes, and some divisions have the same key feature words. In the future, when a new data set is available, we can use learned patterns to classify them with high accuracy in practice. (C) 2022 American Society of Civil Engineers.

引用

页数：13

共 50 条

[1] Satellite Imagery Classification Using Shallow and Deep Learning Approaches
Sainos-Vizuett, Michelle
Hussein Lopez-Nava, Irvin
[J]. PATTERN RECOGNITION (MCPR 2021), 2021, 12725 : 163 - 172
[2] Shallow and deep learning for image classification
Ososkov G.
Goncharov P.
[J]. Optical Memory and Neural Networks, 2017, 26 (4) : 221 - 248
[3] Construction accident narrative classification: An evaluation of text mining techniques
Goh, Yang Miang
Ubeynarayana, C. U.
[J]. ACCIDENT ANALYSIS AND PREVENTION, 2017, 108 : 122 - 130
[4] Investigating the Role of Clustering in Construction-Accident Severity Prediction Using a Heterogeneous and Imbalanced Data Set
Salarian, Ali Akbar
Etemadfard, Hossein
Rahimzadegan, Ali
Ghalehnovi, Mansour
[J]. JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2023, 149 (02)
[5] Machine learning in construction: From shallow to deep learning
Xu, Yayin
Zhou, Ying
Sekula, Przemyslaw
Ding, Lieyun
[J]. DEVELOPMENTS IN THE BUILT ENVIRONMENT, 2021, 6
[6] Remote Sensing Image Classification Using Deep-Shallow Learning
Dou, Peng
Shen, Huanfeng
Li, Zhiwei
Guan, Xiaobin
Huang, Wenli
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 3070 - 3083
[7] Fake reviews classification using deep learning ensemble of shallow convolutions
Javed, Muhammad Saad
Majeed, Hammad
Mujtaba, Hasan
Beg, Mirza Omer
[J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2021, 4 (02): : 883 - 902
[8] Fake reviews classification using deep learning ensemble of shallow convolutions
Muhammad Saad Javed
Hammad Majeed
Hasan Mujtaba
Mirza Omer Beg
[J]. Journal of Computational Social Science, 2021, 4 : 883 - 902
[9] Shallow Classification or Deep Learning: An Experimental Study
Yin, Xu-Cheng
Yang, Chun
Pei, Wei-Yi
Hao, Hong-Wei
[J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1904 - 1909
[10] Deep learning with Shallow architecture for Image Classification
ElAdel, Asma
Ejbali, Ridha
Zaied, Mourad
Ben Amar, Chokri
[J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2015), 2015, : 408 - 412

← 1 2 3 4 5 →