Intelligent mining of safety hazard information from construction documents using semantic similarity and information entropy

被引:5
|
作者
Tian, Dan [1 ]
Li, Mingchao [1 ]
Shen, Yang [2 ]
Han, Shuai [1 ,3 ]
机构
[1] Tianjin Univ, State Key Lab Hydraul Engn Simulat & Safety, Tianjin 300350, Peoples R China
[2] China Three Gorges Corp, Beijing 100038, Peoples R China
[3] Hong Kong Polytech Univ, Dept Bldg & Real Estate, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Construction documents; Safety hazards; Information mining; Semantic similarity; Word2vec; Information entropy; MUTUAL INFORMATION; TF-IDF; IDENTIFICATION; EXTRACTION; SYSTEM; MODEL;
D O I
10.1016/j.engappai.2022.105742
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Project construction on-site is known to be very dangerous workplace environments due to large numbers of safety hazards. Analysis of construction safety hazards is essential to formulate rational safety management plans and prevent accidents. Construction documents contain large volumes of safety hazard information available for analysis. However, such analyses are challenging because the safety hazard information in the construction documents is presented in an unstructured or semi-structured format. This study proposes a method for intelligent mining of safety hazard information, which comprises safety hazard technical term recognition and safety hazard information analysis. The safety hazard technical term recognition model is developed based on semantic similarity and information correlation to build a safety hazard technical term library. The safety hazard information based on the technical term library is mined and analyzed using the term frequency-inverse document frequency method (TF-IDF). Finally, the proposed method is applied to build the safety hazard technical term library, which contains 2697 technical terms, and develop a hydraulic project construction safety hazard analysis system, which can realize the intelligent recognition and application of technical terms. Meanwhile, this system can automatically extract safety hazard information and provide a visualization interface to intuitively show the safety hazard analysis results, which improves the extraction efficiency of safety hazard information. The study provides a new approach for recognizing technical terms and mining safety hazard information, which can lead to enhancing management efficiency and practical knowledge discovery for safety management.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Mining information from sentences through Semantic Web data and Information Extraction tasks
    Martinez-Rodriguez, Jose L.
    Lopez-Arevalo, Ivan
    Rios-Alvarado, Ana B.
    JOURNAL OF INFORMATION SCIENCE, 2022, 48 (01) : 3 - 20
  • [32] Using Semantic Information for Web Usage Mining Based Recommendation
    Salin, Suleyman
    Senkul, Pinar
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 235 - 240
  • [33] Measuring Semantic Similarity of Word Pairs Using Path and Information Content
    Meng, Lingling
    Huang, Runging
    Gu, Junzhong
    INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2014, 7 (03): : 183 - 194
  • [34] An Information Entropy-based Risk (IER) Index of Mining Safety Using Clustering and Statistical Methods
    Eshwar, Dharmasai
    Chatterjee, Snehamoy
    Kaunda, Rennie
    Miller, Hugh
    Majdara, Aref
    MINING METALLURGY & EXPLORATION, 2024, : 1693 - 1708
  • [35] Measuring semantic similarity between words using multiple information sources
    Lei, Jingsheng
    Journal of Information and Computational Science, 2010, 7 (02): : 601 - 608
  • [36] Measure the Semantic Similarity of GO Terms Using Aggregate Information Content
    Song, Xuebo
    Li, Lin
    Srimani, Pradip K.
    Yu, Philip S.
    Wang, James Z.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (03) : 468 - 476
  • [37] Construction of Clinical Pathway based on Similarity-based Mining in Hospital Information System
    Iwata, Haruko
    Hirano, Shoji
    Tsumoto, Shusaku
    2ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2014, 2014, 31 : 1107 - 1115
  • [38] Intelligent information retrieval system using automatic thesaurus construction
    Song, Wei
    Yang, Jucheng
    Li, Chenghua
    Park, Sooncheol
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2011, 40 (04) : 395 - 415
  • [39] Intelligent Telecommunication System Using Semantic-Based Information Retrieval
    Jubilson, E. Ajith
    Dhanavanthini, P.
    Paul, P. Victer
    Pravinpathi, V.
    RamCoumare, M.
    Paranidharan, S.
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 3, 2016, 381 : 137 - 143
  • [40] Analytical Study on Intelligent Information Retrieval System Using Semantic Network
    Sahu, Sanjib Kumar
    Mahapatra, P.
    Balabantaray, R. C.
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 704 - 710