Deep neural network based image annotation

被引:14
|
作者
Zhu, Songhao [1 ]
Shi, Zhe [1 ]
Sun, Chengjian [1 ]
Shen, Shuhan [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Automat, Nanjing 210046, Jiangsu, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 110093, Peoples R China
关键词
Deep learning; Multi-label; Multi-modal; Image annotation;
D O I
10.1016/j.patrec.2015.07.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilabel image annotation is one of the most important open problems in computer vision field. Unlike existing works that usually use conventional visual features to annotate images, features based on deep learning have shown potential to achieve outstanding performance. In this work, we propose a multimodal deep learning framework, which aims to optimally integrate multiple deep neural networks pretrained with convolutional neural networks. In particular, the proposed framework explores a unified two stage learning scheme that consists of (i) learning to fine-tune the parameters of deep neural network with respect to each individual modality, and (ii) learning to find the optimal combination of diverse modalities simultaneously in a coherent process. Experiments conducted on a variety of public datasets evaluate the performance of the proposed framework for multilabel image annotation, in which the encouraging results validate the effectiveness of the proposed algorithms. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:103 / 108
页数:6
相关论文
共 50 条
  • [1] Rock Image Automatic Annotation based on Deep Neural Network
    Zhao, Huitao
    Zhao, Xiaohui
    Liu, Wenxiang
    Wei, Junjie
    Cheng, Guojian
    Qiang, Xinjian
    [J]. PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 1824 - 1828
  • [2] Image Annotation Via Deep Neural Network
    Sun Chengjian
    Zhu, Songhao
    Shi, Zhe
    [J]. 2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 518 - 521
  • [3] IMAGE STRUCTURED ANNOTATION BASED ON DEEP NEURAL NETWORK NATURAL LANGUAGE PROCESSING
    Jia, Jing
    Hua, Jing
    [J]. COMPUTING AND INFORMATICS, 2024, 43 (04) : 926 - 943
  • [4] A NOVEL IMAGE ANNOTATION SCHEME BASED ON NEURAL NETWORK
    Zhao, Yufeng
    Zhao, Yao
    Zhu, Zhenfeng
    Pan, Jeng-Shyang
    [J]. ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, : 644 - +
  • [5] Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation
    Bensaci, Ramla
    Khaldi, Belal
    Aiadi, Oussama
    Benchabana, Ayoub
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [6] Multimodal deep network learning-based image annotation
    Zhu, Songhao
    Li, Xiangxiang
    Shen, Shuhan
    [J]. ELECTRONICS LETTERS, 2015, 51 (12) : 905 - 906
  • [7] Image Annotation Based on Convolutional Neural Network and Topic Model
    Zhang Lei
    Cai Ming
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (20)
  • [8] Multi-modal multi-concept-based deep neural network for automatic image annotation
    Haijiao Xu
    Changqin Huang
    Xiaodi Huang
    Muxiong Huang
    [J]. Multimedia Tools and Applications, 2019, 78 : 30651 - 30675
  • [9] Multi-modal multi-concept-based deep neural network for automatic image annotation
    Xu, Haijiao
    Huang, Changqin
    Huang, Xiaodi
    Huang, Muxiong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) : 30651 - 30675
  • [10] DeEPn: a deep neural network based tool for enzyme functional annotation
    Semwal, Rahul
    Aier, Imlimaong
    Tyagi, Pankaj
    Varadwaj, Pritish Kumar
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2021, 39 (08): : 2733 - 2743