Semantic Regularisation for Recurrent Image Annotation

被引:64
|
作者
Liu, Feng [1 ,2 ]
Xiang, Tao [2 ]
Hospedales, Timothy M. [3 ]
Yang, Wankou [1 ]
Sun, Changyin [1 ]
机构
[1] Southeast Univ, Nanjing, Jiangsu, Peoples R China
[2] Queen Mary Univ London, London, England
[3] Univ Edinburgh, Edinburgh, Midlothian, Scotland
关键词
D O I
10.1109/CVPR.2017.443
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The "CNN-RNN" design pattern is increasingly widely applied in a variety of image annotation tasks including multi-label classification and captioning. Existing models use the weakly semantic CNN hidden layer or its transform as the image embedding that provides the interface between the CNN and RNN. This leaves the RNN overstretched with two jobs: predicting the visual concepts and modelling their correlations for generating structured annotation output. Importantly this makes the end-to-end training of the CNN and RNN slow and ineffective due to the difficulty of back propagating gradients through the RNN to train the CNN. We propose a simple modification to the design pattern that makes learning more effective and efficient. Specifically, we propose to use a semantically regularised embedding layer as the interface between the CNN and RNN. Regularising the interface can partially or completely decouple the learning problems, allowing each to be more effectively trained and jointly training much more efficient. Extensive experiments show that state-of-the art performance is achieved on multi-label classification as well as image captioning.
引用
收藏
页码:4160 / 4168
页数:9
相关论文
共 50 条
  • [1] Semantic Fusion of Image Annotation
    Wu, Xiaoying
    Liang, Yunjuan
    Li, Li
    Ma, Lijuan
    [J]. COMPUTATIONAL MATERIALS SCIENCE, PTS 1-3, 2011, 268-270 : 1386 - 1389
  • [2] Image Semantic Description and Automatic Semantic Annotation
    Liang Meiyu
    Du Junping
    Jia Yingmin
    Sun Zengqi
    [J]. INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 1192 - 1195
  • [3] Automatic image annotation for semantic image retrieval
    Shao, Wenbin
    Naghdy, Golshah
    Phung, Son Lam
    [J]. ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 369 - 378
  • [4] Semantic annotation and retrieval of image collections
    Osman, Taha
    Thakker, Dhavalkumar
    Schaefer, Gerald
    Leroy, Maxime
    Fournier, Alain
    [J]. 21ST EUROPEAN CONFERENCE ON MODELLING AND SIMULATION ECMS 2007: SIMULATIONS IN UNITED EUROPE, 2007, : 324 - +
  • [5] A Semantic Annotation Method For Network Image
    Zhang Zeqing
    [J]. 2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 1807 - 1810
  • [6] A Hybrid Approach for Semantic Image Annotation
    Sezen, Arda
    Turhan, Cigdem
    Sengul, Gokhan
    [J]. IEEE ACCESS, 2021, 9 : 131977 - 131994
  • [7] Image Annotation Based on Semantic Rules
    Ion, A. L.
    [J]. HUMAN-COMPUTER SYSTEMS INTERACTION: BACKGROUNDS AND APPLICATIONS, 2009, 60 : 83 - 94
  • [8] A semantic approach for automatic image annotation
    Oujaoura, Mustapha
    Minaouf, Brahim
    Fakir, Mohammed
    [J]. 2013 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2013,
  • [9] Semantic hierarchies for image annotation: A survey
    Tousch, Anne-Marie
    Herbin, Stephane
    Audibert, Jean-Yves
    [J]. PATTERN RECOGNITION, 2012, 45 (01) : 333 - 345
  • [10] Image Annotation Using a Semantic Hierarchy
    Bouzaieni, Abdessalem
    Tabbone, Salvatore
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2018, 2018, 11004 : 3 - 13