Semantic Regularisation for Recurrent Image Annotation

被引:64
|
作者
Liu, Feng [1 ,2 ]
Xiang, Tao [2 ]
Hospedales, Timothy M. [3 ]
Yang, Wankou [1 ]
Sun, Changyin [1 ]
机构
[1] Southeast Univ, Nanjing, Jiangsu, Peoples R China
[2] Queen Mary Univ London, London, England
[3] Univ Edinburgh, Edinburgh, Midlothian, Scotland
关键词
D O I
10.1109/CVPR.2017.443
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The "CNN-RNN" design pattern is increasingly widely applied in a variety of image annotation tasks including multi-label classification and captioning. Existing models use the weakly semantic CNN hidden layer or its transform as the image embedding that provides the interface between the CNN and RNN. This leaves the RNN overstretched with two jobs: predicting the visual concepts and modelling their correlations for generating structured annotation output. Importantly this makes the end-to-end training of the CNN and RNN slow and ineffective due to the difficulty of back propagating gradients through the RNN to train the CNN. We propose a simple modification to the design pattern that makes learning more effective and efficient. Specifically, we propose to use a semantically regularised embedding layer as the interface between the CNN and RNN. Regularising the interface can partially or completely decouple the learning problems, allowing each to be more effectively trained and jointly training much more efficient. Extensive experiments show that state-of-the art performance is achieved on multi-label classification as well as image captioning.
引用
下载
收藏
页码:4160 / 4168
页数:9
相关论文
共 50 条
  • [21] Incorporating Ontology and SPARQL for Semantic Image Annotation
    Kanimozhi, T.
    Christy, A.
    2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICT 2013), 2013, : 26 - 31
  • [22] Image Semantic Automatic Annotation by Relevance Feedback
    张同珍
    申瑞民
    Journal of Donghua University(English Edition), 2007, (05) : 662 - 666
  • [23] Coherent image annotation by learning semantic distance
    Mei, Tao
    Wang, Yong
    Hua, Xian-Sheng
    Gong, Shaogang
    Li, Shipeng
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 349 - 356
  • [24] Semantic Image Annotation via Hierarchical Classification
    Tsapatsoulis, Nicolas
    Ntalianis, Klimis
    MATHEMATICAL METHODS, COMPUTATIONAL TECHNIQUES, NON-LINEAR SYSTEMS, INTELLIGENT SYSTEMS, 2008, : 469 - +
  • [25] Medical Image Semantic Annotation Based on MIL
    Gang, Jia
    Yuan, Feng
    Bing, Zheng
    2013 ICME INTERNATIONAL CONFERENCE ON COMPLEX MEDICAL ENGINEERING (CME), 2013, : 85 - 90
  • [26] Research on graphical annotation and retrieval of image semantic
    Li, Qian-Qian
    Yang, Ai-Min
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1565 - 1569
  • [27] Hierarchical Classification Scheme for Semantic Image Annotation
    Tsapatsoulis, Nicolas
    2009 FIRST INTERNATIONAL CONFERENCE ON ADVANCES IN MULTIMEDIA, 2009, : 194 - 200
  • [28] A Semantic Context Model for Automatic Image Annotation
    Fu, Xin
    Wang, Dong
    Niu, Sijie
    Zhang, Hengcai
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 536 - 542
  • [29] Image modeling with combined optimization techniques for image semantic annotation
    Dong Yang
    Ping Guo
    Neural Computing and Applications, 2011, 20 : 1001 - 1015
  • [30] Image modeling with combined optimization techniques for image semantic annotation
    Yang, Dong
    Guo, Ping
    NEURAL COMPUTING & APPLICATIONS, 2011, 20 (07): : 1001 - 1015