Deep Learning Models for Content-Based Retrieval of Construction Visual Data

被引：0

作者：

Nath, Nipun D. ^{[1
]}

Behzadan, Amir H. ^{[2
]}

机构：

[1] Texas A&M Univ, Zachry Dept Civil Engn, College Stn, TX 77843 USA

[2] Texas A&M Univ, Dept Construct Sci, College Stn, TX 77843 USA

来源：

COMPUTING IN CIVIL ENGINEERING 2019: DATA, SENSING, AND ANALYTICS | 2019年

基金：

美国国家科学基金会;

关键词：

CONVOLUTIONAL NEURAL-NETWORKS;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Deep learning (DL) algorithms such as convolutional neural networks (CNNs) can assist in tasks such as content search and retrieval, image tagging and captioning, scene description, motion prediction, and language processing. This paper presents research that aims at designing and validating DL models for automated content-based retrieval of daily construction images and videos. Information retrieval from visual data is key to labor-intensive tasks such as safety inspection, crew activity logging, and work progress documentation. In order to train deep neural networks (DNNs), large repositories of high-quality annotated visual data are needed. However, generating such labeled datasets in construction is non-trivial and resource intensive, and requires specific skillset. To overcome this challenge, we present a methodology for fast object detection and tagging in visual data using DNNs trained with a relatively small dataset. Two state-of-the-art object detection algorithms, i.e., you-only-look-once (YOLO) and mask region-based CNN (a.k.a., Mask R-CNN) are investigated. Training data is obtained via web mining (the Internet) and crowdsourcing. Results show that training on data from both sources yields the best classification accuracy. Testing the model on new data reveals that the fully-tuned model can achieve a minimum mean average precision (mAP) of 79% when tested on different image subsets.

引用

页码：66 / 73

页数：8

共 50 条

[1] Learning visual keywords for content-based retrieval
Lim, JH
[J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 169 - 173
[2] Weakly supervised learning of visual models and its application to content-based retrieval
Schmid, C
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 56 (1-2) : 7 - 16
[3] Weakly Supervised Learning of Visual Models and Its Application to Content-Based Retrieval
Cordelia Schmid
[J]. International Journal of Computer Vision, 2004, 56 : 7 - 16
[4] Content-Based Image Retrieval Using Multi-deep Learning Models
Bui Thanh Hung
[J]. NEXT GENERATION OF INTERNET OF THINGS, 2023, 445 : 347 - 357
[5] Content-Based Video Big Data Retrieval with Extensive Features and Deep Learning
Thuong-Cang Phan
Anh-Cang Phan
Hung-Phi Cao
Thanh-Ngoan Trieu
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (13):
[6] Construction and Implementation of Content-Based National Music Retrieval Model Under Deep Learning
Shi, Jing
Liu, Lei
[J]. INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2024, 15 (01) : 1 - 17
[7] Deep Learning for Plant Classification and Content-Based Image Retrieval
Gyires-Toth, Balint Pal
Osvath, Marton
Papp, David
Szucs, Gabor
[J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2019, 19 (01) : 88 - 100
[8] Deep Learning for Content-Based Image Retrieval: A Comprehensive Study
Wan, Ji
Wang, Dayong
Hoi, Steven C. H.
Wu, Pengcheng
Zhu, Jianke
Zhang, Yongdong
Li, Jintao
[J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 157 - 166
[9] Deep learning for content-based image retrieval in FHE algorithms
Abdullah, Sura Mahmood
Jaber, Mustafa Musa
[J]. JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
[10] Using Deep Learning for Content-Based Medical Image Retrieval
Sun, Qinpei
Yang, Yuanyuan
Sun, Jianyong
Yang, Zhiming
Zhang, Jianguo
[J]. MEDICAL IMAGING 2017: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2017, 10138

← 1 2 3 4 5 →