Large-scale semantic web image retrieval using bimodal deep learning techniques

被引:12
|
作者
Huang, Changqin [1 ,2 ]
Xu, Haijiao [1 ]
Xie, Liang [3 ]
Zhu, Jia [2 ]
Xu, Chunyan [4 ]
Tang, Yong [2 ]
机构
[1] South China Normal Univ, Sch Informat Technol Educ, Guangzhou, Guangdong, Peoples R China
[2] South China Normal Univ, Guangdong Engn Res Ctr Smart Learning, Guangzhou, Guangdong, Peoples R China
[3] Wuhan Univ Technol, Sch Sci, Wuhan, Hubei, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Convolutional neural networks; Multi-concept scene classifiers; Concept based image retrieval; Bimodal learning; SIMILARITY;
D O I
10.1016/j.ins.2017.11.043
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic web image retrieval is useful to end-users for semantic image searches over the Internet. This paper aims to develop image retrieval techniques for large-scale web image databases. An advanced retrieval system, termed Multi-concept Retrieval using Bimodal Deep Learning (MRBDL), is proposed and implemented using Convolutional Neural Networks (CNNs) which can effectively capture semantic correlations between a visual image and its free contextual tags. Different from existing approaches using multiple and independent concepts in a query, MRBDL considers multiple concepts as a holistic scene for retrieval model learning. In particular, we first use a bimodal CNN to train a holistic scene classifier in two modalities, and then semantic correlations of the sub-concepts included in the images are leveraged to boost holistic scene recognition. The predicted semantic scores obtained from holistic scene classifier are combined with complementary information on web images to improve the retrieval performance. Experiments have been carried out over two publicly available web image databases. The results show that our proposed approach performs favorably compared with several other state-of-the-art methods. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:331 / 348
页数:18
相关论文
共 50 条
  • [41] Deep image retrieval of large-scale vessels images based on BoW model
    Chi Tian
    Jinfeng Xia
    Ji Tang
    Hui Yin
    [J]. Multimedia Tools and Applications, 2020, 79 : 9387 - 9401
  • [42] Deep image retrieval of large-scale vessels images based on BoW model
    Tian, Chi
    Xia, Jinfeng
    Tang, Ji
    Yin, Hui
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 9387 - 9401
  • [44] Large-scale information retrieval with latent semantic indexing
    Letsche, TA
    Berry, MW
    [J]. INFORMATION SCIENCES, 1997, 100 (1-4) : 105 - 137
  • [45] Hierarchical Semantic Indexing for Large Scale Image Retrieval
    Jia Deng
    Berg, Alexander C.
    Li Fei-Fei
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 785 - 792
  • [46] Deep Neighborhood Structure-Preserving Hashing for Large-Scale Image Retrieval
    Qin, Qibing
    Xie, Kezhen
    Zhang, Wenfeng
    Wang, Chengduan
    Huang, Lei
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1881 - 1893
  • [47] Large-Scale Remote Sensing Image Retrieval by Deep Hashing Neural Networks
    Li, Yansheng
    Zhang, Yongjun
    Huang, Xin
    Zhu, Hu
    Ma, Jiayi
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (02): : 950 - 965
  • [48] Deep Adaptive Quadruplet Hashing With Probability Sampling for Large-Scale Image Retrieval
    Qin, Qibing
    Huang, Lei
    Xie, Kezhen
    Wei, Zhiqiang
    Wang, Chengduan
    Zhang, Wenfeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7914 - 7927
  • [49] Implementation of a Large-Scale Retinal Image Curation Workflow Using Deep Learning Framework
    Balaji, Rohit
    Heathcote, Jen
    Slater, Robert
    Barrett, Nancy
    Voland, Rick
    Tomic, Vesna
    McDonald, Jared
    Blodi, Barbara A.
    Domalpally, Amitha
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2022, 63 (07)
  • [50] Large-Scale JPEG Image Steganalysis Using Hybrid Deep-Learning Framework
    Zeng, Jishen
    Tan, Shunquan
    Li, Bin
    Huang, Jiwu
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (05) : 1200 - 1214