Large-scale semantic web image retrieval using bimodal deep learning techniques

被引:12
|
作者
Huang, Changqin [1 ,2 ]
Xu, Haijiao [1 ]
Xie, Liang [3 ]
Zhu, Jia [2 ]
Xu, Chunyan [4 ]
Tang, Yong [2 ]
机构
[1] South China Normal Univ, Sch Informat Technol Educ, Guangzhou, Guangdong, Peoples R China
[2] South China Normal Univ, Guangdong Engn Res Ctr Smart Learning, Guangzhou, Guangdong, Peoples R China
[3] Wuhan Univ Technol, Sch Sci, Wuhan, Hubei, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Convolutional neural networks; Multi-concept scene classifiers; Concept based image retrieval; Bimodal learning; SIMILARITY;
D O I
10.1016/j.ins.2017.11.043
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic web image retrieval is useful to end-users for semantic image searches over the Internet. This paper aims to develop image retrieval techniques for large-scale web image databases. An advanced retrieval system, termed Multi-concept Retrieval using Bimodal Deep Learning (MRBDL), is proposed and implemented using Convolutional Neural Networks (CNNs) which can effectively capture semantic correlations between a visual image and its free contextual tags. Different from existing approaches using multiple and independent concepts in a query, MRBDL considers multiple concepts as a holistic scene for retrieval model learning. In particular, we first use a bimodal CNN to train a holistic scene classifier in two modalities, and then semantic correlations of the sub-concepts included in the images are leveraged to boost holistic scene recognition. The predicted semantic scores obtained from holistic scene classifier are combined with complementary information on web images to improve the retrieval performance. Experiments have been carried out over two publicly available web image databases. The results show that our proposed approach performs favorably compared with several other state-of-the-art methods. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:331 / 348
页数:18
相关论文
共 50 条
  • [1] Semantic Hierarchy Preserving Deep Hashing for Large-Scale Image Retrieval
    Ming Zhang
    Zhe, Xuefei
    Le Ou-Yang
    Chen, Shifeng
    Hong Yan
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [2] LARGE-SCALE FACE IMAGE RETRIEVAL BASED ON HADOOP AND DEEP LEARNING
    Huang Yuanyuan
    Tang Yuan
    Xiong Taisong
    [J]. 2020 17TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2020, : 326 - 329
  • [3] Deep Hashing for Large-scale Image Retrieval
    Li Mengting
    Liu Jun
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10940 - 10944
  • [4] Joint learning based deep supervised hashing for large-scale image retrieval
    Gu, Guanghua
    Liu, Jiangtao
    Li, Zhuoyi
    Huo, Wenhua
    Zhao, Yao
    [J]. NEUROCOMPUTING, 2020, 385 : 348 - 357
  • [5] Learning Multilevel Semantic Similarity for Large-Scale Multi-Label Image Retrieval
    Song, Ge
    Tan, Xiaoyang
    [J]. ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 64 - 72
  • [6] Deep semantic preserving hashing for large scale image retrieval
    Masoumeh Zareapoor
    Jie Yang
    Deepak Kumar Jain
    Pourya Shamsolmoali
    Neha Jain
    Surya Kant
    [J]. Multimedia Tools and Applications, 2019, 78 : 23831 - 23846
  • [7] Deep semantic preserving hashing for large scale image retrieval
    Zareapoor, Masoumeh
    Yang, Jie
    Jain, Deepak Kumar
    Shamsolmoali, Pourya
    Jain, Neha
    Kant, Surya
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (17) : 23831 - 23846
  • [8] Deep Product Quantization for Large-Scale Image Retrieval
    Zhai, Qi
    Jiang, Mingyan
    [J]. 2019 4TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2019), 2019, : 198 - 202
  • [9] Cascaded Deep Hashing for Large-Scale Image Retrieval
    Lu, Jun
    Zhang, Li
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT VI, 2018, 11306 : 419 - 429
  • [10] Semantic Video Retrieval using Deep Learning Techniques
    Yasin, Danish
    Sohail, Ashbal
    Siddiqi, Imran
    [J]. PROCEEDINGS OF 2020 17TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2020, : 338 - 343