Deep Convolutional Neural Network for Bidirectional Image-Sentence Mapping

被引:1
|
作者
Yu, Tianyuan [1 ]
Bai, Liang [1 ]
Guo, Jinlin [1 ]
Yang, Zheng [1 ]
Xie, Yuxiang [1 ]
机构
[1] Natl Univ Def Technol, Coll Informat Syst & Management, Changsha 410073, Hunan, Peoples R China
来源
关键词
D O I
10.1007/978-3-319-51814-5_12
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of the Internet and the explosion of data volume, it is important to access the cross-media big data including text, image, audio, and video, etc., efficiently and accurately. However, the content heterogeneity and semantic gap make it challenging to retrieve such cross-media archives. The existing approaches try to learn the connection between multiple modalities by direct utilization of hand-crafted low-level features, and the learned correlations are merely constructed with high-level feature representations without considering semantic information. To further exploit the intrinsic structures of multimodal data representations, it is essential to build up an interpretable correlation between these heterogeneous representations. In this paper, a deep model is proposed to first learn the high-level feature representation shared by different modalities like texts and images, with convolutional neural network (CNN). Moreover, the learned CNN features can reflect the salient objects as well as the details in the images and sentences. Experimental results demonstrate that proposed approach outperforms the current state-of-the-art base methods on public dataset of Flickr8K.
引用
收藏
页码:136 / 147
页数:12
相关论文
共 50 条
  • [21] Rocket Image Classification Based on Deep Convolutional Neural Network
    Zhang, Liang
    Chen, Zhenhua
    Wang, Jian
    Huang, Zhaodun
    [J]. 2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS (ICCCAS 2018), 2018, : 383 - 386
  • [22] Stereoscopic image quality assessment by deep convolutional neural network
    Fang, Yuming
    Yan, Jiebin
    Liu, Xuelin
    Wang, Jiheng
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 58 : 400 - 406
  • [23] Image Denoising using Deep Learning: Convolutional Neural Network
    Ghose, Shreyasi
    Singh, Nishi
    Singh, Prabhishek
    [J]. PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 511 - 517
  • [24] Deep Convolutional Neural Network for Microscopic Bacteria Image Classification
    Wahid, Md Ferdous
    Hasan, Md Jahid
    Alom, Md Shahin
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2019, : 866 - 869
  • [25] Image Super-Resolution With Deep Convolutional Neural Network
    Ji, Xiancai
    Lu, Yao
    Guo, Li
    [J]. 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 626 - 630
  • [26] Remote Sensing Image Fusion With Deep Convolutional Neural Network
    Shao, Zhenfeng
    Cai, Jiajun
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (05) : 1656 - 1669
  • [27] PolSAR image classification based on deep convolutional neural network
    Wang, Yunyan
    Wang, Gaihua
    Lan, Yihua
    [J]. Metallurgical and Mining Industry, 2015, 7 (08): : 366 - 371
  • [28] Medical image retrieval using deep convolutional neural network
    Qayyum, Adnan
    Anwar, Syed Muhammad
    Awais, Muhammad
    Majid, Muhammad
    [J]. NEUROCOMPUTING, 2017, 266 : 8 - 20
  • [29] Hyperspectral image reconstruction by deep convolutional neural network for classification
    Li, Yunsong
    Xie, Weiying
    Li, Huaqing
    [J]. PATTERN RECOGNITION, 2017, 63 : 371 - 383
  • [30] Robust deep convolutional neural network against image distortions
    Wang, Liang-Yao
    Chen, Sau-Gee
    Chien, Feng-Tsun
    [J]. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2021, 10