Deep Learning Models for Content-Based Retrieval of Construction Visual Data

被引:0
|
作者
Nath, Nipun D. [1 ]
Behzadan, Amir H. [2 ]
机构
[1] Texas A&M Univ, Zachry Dept Civil Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, Dept Construct Sci, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
CONVOLUTIONAL NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep learning (DL) algorithms such as convolutional neural networks (CNNs) can assist in tasks such as content search and retrieval, image tagging and captioning, scene description, motion prediction, and language processing. This paper presents research that aims at designing and validating DL models for automated content-based retrieval of daily construction images and videos. Information retrieval from visual data is key to labor-intensive tasks such as safety inspection, crew activity logging, and work progress documentation. In order to train deep neural networks (DNNs), large repositories of high-quality annotated visual data are needed. However, generating such labeled datasets in construction is non-trivial and resource intensive, and requires specific skillset. To overcome this challenge, we present a methodology for fast object detection and tagging in visual data using DNNs trained with a relatively small dataset. Two state-of-the-art object detection algorithms, i.e., you-only-look-once (YOLO) and mask region-based CNN (a.k.a., Mask R-CNN) are investigated. Training data is obtained via web mining (the Internet) and crowdsourcing. Results show that training on data from both sources yields the best classification accuracy. Testing the model on new data reveals that the fully-tuned model can achieve a minimum mean average precision (mAP) of 79% when tested on different image subsets.
引用
收藏
页码:66 / 73
页数:8
相关论文
共 50 条
  • [1] Learning visual keywords for content-based retrieval
    Lim, JH
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 169 - 173
  • [2] Weakly supervised learning of visual models and its application to content-based retrieval
    Schmid, C
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 56 (1-2) : 7 - 16
  • [3] Weakly Supervised Learning of Visual Models and Its Application to Content-Based Retrieval
    Cordelia Schmid
    [J]. International Journal of Computer Vision, 2004, 56 : 7 - 16
  • [4] Content-Based Image Retrieval Using Multi-deep Learning Models
    Bui Thanh Hung
    [J]. NEXT GENERATION OF INTERNET OF THINGS, 2023, 445 : 347 - 357
  • [5] Content-Based Video Big Data Retrieval with Extensive Features and Deep Learning
    Thuong-Cang Phan
    Anh-Cang Phan
    Hung-Phi Cao
    Thanh-Ngoan Trieu
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [6] Construction and Implementation of Content-Based National Music Retrieval Model Under Deep Learning
    Shi, Jing
    Liu, Lei
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2024, 15 (01) : 1 - 17
  • [7] Deep Learning for Plant Classification and Content-Based Image Retrieval
    Gyires-Toth, Balint Pal
    Osvath, Marton
    Papp, David
    Szucs, Gabor
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2019, 19 (01) : 88 - 100
  • [8] Deep Learning for Content-Based Image Retrieval: A Comprehensive Study
    Wan, Ji
    Wang, Dayong
    Hoi, Steven C. H.
    Wu, Pengcheng
    Zhu, Jianke
    Zhang, Yongdong
    Li, Jintao
    [J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 157 - 166
  • [9] Deep learning for content-based image retrieval in FHE algorithms
    Abdullah, Sura Mahmood
    Jaber, Mustafa Musa
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [10] Using Deep Learning for Content-Based Medical Image Retrieval
    Sun, Qinpei
    Yang, Yuanyuan
    Sun, Jianyong
    Yang, Zhiming
    Zhang, Jianguo
    [J]. MEDICAL IMAGING 2017: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2017, 10138