Automatic bridge inspection database construction through hybrid information extraction and large language models

被引:0
|
作者
Zhang, Chenhong [1 ]
Lei, Xiaoming [2 ]
Xia, Ye [1 ,3 ]
Sun, Limin [1 ,3 ]
机构
[1] Tongji Univ, Dept Bridge Engn, Shanghai, Peoples R China
[2] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[3] Shanghai Qi Zhi Inst, Shanghai, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Bridge inspection data; Natural language processing; Information extraction; Large languge model; Pseudo label;
D O I
10.1016/j.dibe.2024.100549
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Regular bridge inspections generate extensive reports that, while critical for maintenance, often remain underutilized due to their unstructured format. Traditional information extraction methods depend on intricate labeling systems that commonly require time-consuming and labor-intensive labeling. This paper presents a novel bridge inspection database construction method leveraging LLM-assisted information extraction. First, we introduce the pseudo-labelling method using a closed-source LLM to generate high-quality data. Then we propose the hybrid extraction pipeline to extract relevant information segments and process them by a generation-based IE model, fine-tuned on pseudo-labeled data. Finally, the extracted data is used to construct the bridge inspection database. The proposed method, validated with real-world data, not only demonstrates higher extraction precision than the closed-source LLM used for pseudo-labeling but also outperforms traditional methods in both data preparation time and extraction accuracy. This approach provides a scalable solution for more proactive and data-driven bridge maintenance strategies.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Prompting Large Language Models for Automatic Question Tagging
    Xu, Nuojia
    Xue, Dizhan
    Qian, Shengsheng
    Fang, Quan
    Hu, Jun
    MACHINE INTELLIGENCE RESEARCH, 2025,
  • [32] Automatic Scoring of Metaphor Creativity with Large Language Models
    DiStefano, Paul V.
    Patterson, John D.
    Beaty, Roger E.
    CREATIVITY RESEARCH JOURNAL, 2024,
  • [33] Automatic Model Selection with Large Language Models for Reasoning
    Zhao, James Xu
    Xie, Yuxi
    Kawaguchi, Kenji
    He, Junxian
    Xie, Michael Qizhe
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 758 - 783
  • [34] Crack image classification and information extraction in steel bridges using multimodal large language models
    Wang, Xiao
    Yue, Qingrui
    Liu, Xiaogang
    AUTOMATION IN CONSTRUCTION, 2025, 171
  • [35] Probing the Consistency of Situational Information Extraction with Large Language Models: A Case Study on Crisis Computing
    Salfinger, Andrea
    Snidaro, Lauro
    2024 IEEE CONFERENCE ON COGNITIVE AND COMPUTATIONAL ASPECTS OF SITUATION MANAGEMENT, COGSIMA, 2024, : 91 - 98
  • [36] Leveraging Medical Knowledge Graphs and Large Language Models for Enhanced Mental Disorder Information Extraction
    Park, Chaelim
    Lee, Hayoung
    Jeong, Ok-ran
    FUTURE INTERNET, 2024, 16 (08)
  • [37] The HisClima database: historical weather logs for automatic transcription and information extraction
    Romero, Veronica
    Andreu Sanchez, Joan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10141 - 10148
  • [38] Automatic Information Extraction through Mobile Phones.
    Vijayalakshmi, I.
    Devi, Sobha Lalitha
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,
  • [39] Construction of a Japanese Financial Benchmark for Large Language Models
    Preferred Networks, Inc., Tokyo, Japan
    Jt. Workshop Financ. Technol. Nat. Lang. Process., Knowl. Discov. from Unstructured Data Financ. Serv. Econ. Nat. Lang. Process., FinNLP-KDF-ECONLP LREC-COLING - Workshop Proc., (1-9):
  • [40] Revisiting Relation Extraction in the era of Large Language Models
    Wadhwa, Somin
    Amir, Silvio
    Wallace, Byron C.
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15566 - 15589