Automatic Image Annotation using Deep Learning Representations

被引:93
|
作者
Murthy, Venkatesh N. [1 ]
Maji, Subhransu [1 ]
Manmatha, R. [1 ]
机构
[1] Univ Massachusetts, Sch Comp Sci, Amherst, MA 01003 USA
关键词
Image annotation; deep learning; word embeddings; CCA;
D O I
10.1145/2671188.2749391
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose simple and effective models for the image annotation that make use of Convolutional Neural Network (CNN) features extracted from an image and word embedding vectors to represent their associated tags. Our first set of models is based on the Canonical Correlation Analysis (CCA) framework that helps in modeling both views visual features (CNN feature) and textual features (word embedding vectors) of the data. Results on all three variants of the CCA models, namely linear CCA, kernel CCA and CCA with k-nearest neighbor (CCA-KNN) clustering, are reported. The best results are obtained using CCA-KNN which outperforms previous results on the Corel-5k and the ESP-Game datasets and achieves comparable results on the IAPRTC-12 dataset. In our experiments we evaluate CNN features in the existing models which bring out the advantages of it over dozens of handcrafted features. We also demonstrate that word embedding vectors perform better than binary vectors as a representation of the tags associated with an image. In addition we compare the CCA model to a simple CNN based linear regression model, which allows the CNN layers to be trained using back-propagation.
引用
收藏
页码:603 / 606
页数:4
相关论文
共 50 条
  • [1] Automatic Image Annotation Using Adaptive Convolutional Deep Learning Model
    Jayaraj, R.
    Lokesh, S.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (01): : 481 - 497
  • [2] Automatic Image Annotation and Deep Learning for Tooth CT Image Segmentation
    Gou, Miao
    Rao, Yunbo
    Zhang, Minglu
    Sun, Jianxun
    Cheng, Keyang
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 519 - 528
  • [3] Semantics automatic annotation in medical image based on deep learning
    Yin, Shoulin
    Li, Hang
    Teng, Lin
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 124 : 19 - 20
  • [4] Image Annotation Using Deep Learning: A Review
    Ojha, Utkarsh
    Adhikari, Utsav
    Singh, Dushyant Kumar
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL (I2C2), 2017,
  • [5] Automatic Annotation of Subsea Pipelines Using Deep Learning
    Stamoulakatos, Anastasios
    Cardona, Javier
    McCaig, Chris
    Murray, David
    Filius, Hein
    Atkinson, Robert
    Bellekens, Xavier
    Michie, Craig
    Andonovic, Ivan
    Lazaridis, Pavlos
    Hamilton, Andrew
    Hossain, Md Moinul
    Di Caterina, Gaetano
    Tachtatzis, Christos
    [J]. SENSORS, 2020, 20 (03)
  • [6] Automatic Annotation of Coral Reefs using Deep Learning
    Mahmood, A.
    Bennamoun, M.
    An, S.
    Sohel, F.
    Boussaid, F.
    Hovey, R.
    Kendrick, G.
    Fisher, R. B.
    [J]. OCEANS 2016 MTS/IEEE MONTEREY, 2016,
  • [7] Automatic Image Annotation using Word Embedding Learning
    Chen, Qi
    Yip, Andy M.
    Tan, Chew Lim
    [J]. 2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 269 - 276
  • [8] Annotation-efficient deep learning for automatic medical image segmentation
    Shanshan Wang
    Cheng Li
    Rongpin Wang
    Zaiyi Liu
    Meiyun Wang
    Hongna Tan
    Yaping Wu
    Xinfeng Liu
    Hui Sun
    Rui Yang
    Xin Liu
    Jie Chen
    Huihui Zhou
    Ismail Ben Ayed
    Hairong Zheng
    [J]. Nature Communications, 12
  • [9] Annotation-efficient deep learning for automatic medical image segmentation
    Wang, Shanshan
    Li, Cheng
    Wang, Rongpin
    Liu, Zaiyi
    Wang, Meiyun
    Tan, Hongna
    Wu, Yaping
    Liu, Xinfeng
    Sun, Hui
    Yang, Rui
    Liu, Xin
    Chen, Jie
    Zhou, Huihui
    Ben Ayed, Ismail
    Zheng, Hairong
    [J]. NATURE COMMUNICATIONS, 2021, 12 (01)
  • [10] An Automatic Annotation Algorithm for Deep Learning Image Datasets Based on HOG Features
    Ye, Fei
    Zhang, Xiao-guo
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELING AND SIMULATION (AMMS 2018), 2018, 305 : 128 - 135