Safety compliance checking of construction behaviors using visual question answering

被引:15
|
作者
Ding, Yuexiong [1 ,2 ]
Liu, Muyang [1 ,2 ]
Luo, Xiaowei [1 ,2 ]
机构
[1] City Univ Hong Kong, Dept Architecture & Civil Engn, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Shenzhen Res Inst, Architecture & Civil Engn Res Ctr, Shenzhen, Peoples R China
关键词
Construction safety management; Safety compliance checking; Visual reasoning; Visual question answering; Cross -modal model; Vision -and -language Transformer; FALLS;
D O I
10.1016/j.autcon.2022.104580
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Unsafe construction behavior, one of the leading factors of accidents and casualties, can be reduced by strengthening construction inspection. However, current methods use either manual inspection or inefficient cross-modal models based on multiple backbone networks. To alleviate the problems, a "rule-question" trans-formation and annotation system is formulated, and the unsafe behavior detection is turned into a visual reasoning task: visual question answering (VQA). The VQA model is developed based on a vision-and-language Transformer, and the unsafe behavior could be identified based on the output answers. A dataset containing 16 safety rules and 2386 related construction images is used to fine-tune and validate the VQA model. The results show that the developed VQA model achieves an average recall of 0.81 at a faster reasoning speed. Finally, an applet for safety report generation is implemented to demonstrate the feasibility and practicability of the safety compliance checking based on VQA.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Scene Understanding for Autonomous Driving Using Visual Question Answering
    Wantiez, Adrien
    Qiu, Tianming
    Matthes, Stefan
    Shen, Hao
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [22] EXPLAINABLE FACT-CHECKING THROUGH QUESTION ANSWERING
    Yang, Jing
    Vega-Oliveros, Didier
    Seibt, Tais
    Rocha, Anderson
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8952 - 8956
  • [23] Explaining Disagreement in Visual Question Answering Using Eye Tracking
    Hindennach, Susanne
    Shi, Lei
    Bulling, Andreas
    PROCEEDINGS OF THE 2024 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, ETRA 2024, 2024,
  • [24] Empirical study on using adapters for debiased Visual Question Answering
    Cho, Jae Won
    Argaw, Dawit Mureja
    Oh, Youngtaek
    Kim, Dong-Jin
    Kweon, In So
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 237
  • [25] Visual Question Answering using Hierarchical Dynamic Memory Networks
    Shang, Jiayu
    Li, Shiren
    Duan, Zhikui
    Huang, Junwei
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [26] Safety compliance question
    Vetier, Terry
    Titus, J. B.
    CONTROL ENGINEERING, 2008, 55 (09) : 16 - 16
  • [27] An Improved Attention for Visual Question Answering
    Rahman, Tanzila
    Chou, Shih-Han
    Sigal, Leonid
    Carenini, Giuseppe
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1653 - 1662
  • [28] Robust Explanations for Visual Question Answering
    Patro, Badri N.
    Patel, Shivansh
    Namboodiri, Vinay P.
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1566 - 1575
  • [29] Visual Question Answering for Cultural Heritage
    Bongini, Pietro
    Becattini, Federico
    Bagdanov, Andrew D.
    Del Bimbo, Alberto
    INTERNATIONAL CONFERENCE FLORENCE HERI-TECH: THE FUTURE OF HERITAGE SCIENCE AND TECHNOLOGIES, 2020, 949
  • [30] Question -Led object attention for visual question answering
    Gao, Lianli
    Cao, Liangfu
    Xu, Xing
    Shao, Jie
    Song, Jingkuan
    NEUROCOMPUTING, 2020, 391 : 227 - 233