Intelligent mining of safety hazard information from construction documents using semantic similarity and information entropy

被引:5
|
作者
Tian, Dan [1 ]
Li, Mingchao [1 ]
Shen, Yang [2 ]
Han, Shuai [1 ,3 ]
机构
[1] Tianjin Univ, State Key Lab Hydraul Engn Simulat & Safety, Tianjin 300350, Peoples R China
[2] China Three Gorges Corp, Beijing 100038, Peoples R China
[3] Hong Kong Polytech Univ, Dept Bldg & Real Estate, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Construction documents; Safety hazards; Information mining; Semantic similarity; Word2vec; Information entropy; MUTUAL INFORMATION; TF-IDF; IDENTIFICATION; EXTRACTION; SYSTEM; MODEL;
D O I
10.1016/j.engappai.2022.105742
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Project construction on-site is known to be very dangerous workplace environments due to large numbers of safety hazards. Analysis of construction safety hazards is essential to formulate rational safety management plans and prevent accidents. Construction documents contain large volumes of safety hazard information available for analysis. However, such analyses are challenging because the safety hazard information in the construction documents is presented in an unstructured or semi-structured format. This study proposes a method for intelligent mining of safety hazard information, which comprises safety hazard technical term recognition and safety hazard information analysis. The safety hazard technical term recognition model is developed based on semantic similarity and information correlation to build a safety hazard technical term library. The safety hazard information based on the technical term library is mined and analyzed using the term frequency-inverse document frequency method (TF-IDF). Finally, the proposed method is applied to build the safety hazard technical term library, which contains 2697 technical terms, and develop a hydraulic project construction safety hazard analysis system, which can realize the intelligent recognition and application of technical terms. Meanwhile, this system can automatically extract safety hazard information and provide a visualization interface to intuitively show the safety hazard analysis results, which improves the extraction efficiency of safety hazard information. The study provides a new approach for recognizing technical terms and mining safety hazard information, which can lead to enhancing management efficiency and practical knowledge discovery for safety management.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Impact of information presentation on interpretability of spatial hazard information: lessons from a study in avalanche safety
    Fisher, Kathryn C.
    Haegeli, Pascal
    Mair, Patrick
    NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2021, 21 (10) : 3219 - 3242
  • [42] Improving pattern quality in web usage mining by using semantic information
    Senkul, Pinar
    Salin, Suleyman
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 30 (03) : 527 - 541
  • [43] Improving pattern quality in web usage mining by using semantic information
    Pinar Senkul
    Suleyman Salin
    Knowledge and Information Systems, 2012, 30 : 527 - 541
  • [44] Mining safety hazard management collaboration features from large construction projects
    Zhang, Dongcheng
    Qiang, Maoshan
    Jiang, Hanchen
    Huang, Yujie
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (02): : 208 - 214
  • [45] Intelligent Safety Information Gathering System Using a Smart Blackbox
    Kang, Chanjin
    Heo, Seo Weon
    2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2017,
  • [46] Mining protein interactions from biomedical literature using semantic similarity
    Schmitt, Charles
    Cox, Steven
    Christopherson, Laura
    Scott, Erick
    Firrincieli, Stephen
    Baker, Nancy
    Tutubalina, Elena
    Tropsha, Alexander
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [47] Semantic similarity and mutual information predicting sentence comprehension: the case of dangling topic construction in Chinese
    Sun, Kun
    Wang, Rong
    JOURNAL OF COGNITIVE PSYCHOLOGY, 2023, 35 (02) : 142 - 165
  • [48] HιLεX: A system for semantic information extraction from web documents
    Ruffolo, Massimo
    Manna, Marco
    ENTERPRISE INFORMATION SYSTEMS-BOOK, 2008, 3 : 194 - +
  • [49] A deep learning based method for extracting semantic information from patent documents
    Liang Chen
    Shuo Xu
    Lijun Zhu
    Jing Zhang
    Xiaoping Lei
    Guancan Yang
    Scientometrics, 2020, 125 : 289 - 312
  • [50] A deep learning based method for extracting semantic information from patent documents
    Chen, Liang
    Xu, Shuo
    Zhu, Lijun
    Zhang, Jing
    Lei, Xiaoping
    Yang, Guancan
    SCIENTOMETRICS, 2020, 125 (01) : 289 - 312