A novel image annotation model based on content representation with multi-layer segmentation

被引:9
|
作者
Zhang, Jing [1 ,2 ]
Zhao, Yaxin [1 ]
Li, Da [1 ]
Chen, Zhihua [1 ]
Yuan, Yubo [1 ]
机构
[1] E China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210008, Jiangsu, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2015年 / 26卷 / 06期
关键词
Multi-label; Image content representation; Multi-layer segmentation; Bag-of-words; Conditional random fields; Image annotation; BAG-OF-WORDS; CLASSIFICATION; SVMS;
D O I
10.1007/s00521-014-1815-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image automatic annotation is an important issue of semantic-based image retrieval, and it is still a challenging problem for the reason of semantic gap. In this paper, a novel model with three parts is proposed. The first one is multi-layer image segmentation, in which saliency analysis and normalized cut are combined to segment images into semantic regions in the first layer. While in the second layer, the semantic regions are segmented into grids further . The second one is image content representation by region-based bag-of-words (RBoW) model, which is the variant of BoW model. Considering the correlations of labels, we adopt second-order CRFs as the third part of our model to ensure the accuracy of automatic image annotation. Experimental results show that our multi-layer segmentation-based image annotation model can achieve promising performance for multi-labeling and outperform the model based on single-layer segmentation and previous algorithm on Corel 5K and Pascal VOC 2007 datasets .
引用
收藏
页码:1407 / 1422
页数:16
相关论文
共 50 条
  • [21] Multi-layer Representation Learning for Robust OOD Image Classification
    Ballas, Aristotelis
    Diou, Christos
    [J]. PROCEEDINGS OF THE 12TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE, SETN 2022, 2022,
  • [22] A multi-layer MRF model for video object segmentation
    Kato, Z
    Pong, TC
    [J]. COMPUTER VISION - ACCV 2006, PT II, 2006, 3852 : 953 - 962
  • [23] Ship detection in SAR image based on multi-layer saliency model
    Hu, Qi
    Hu, Shaohai
    Liu, Shuaiqi
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (02): : 478 - 487
  • [24] Designing a symmetric classifier for image annotation using multi-layer sparse coding
    Tariq, Amara
    Foroosh, Hassan
    [J]. IMAGE AND VISION COMPUTING, 2018, 69 : 33 - 43
  • [25] Unsupervised RGB-D Image Segmentation by Multi-layer Clustering
    Khan, Mahfuzur Rahman
    Rahman, A. B. M. Muhitur
    Rahaman, G. M. Atiqur
    Hasnat, Md Abul
    [J]. 2016 5TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION (ICIEV), 2016, : 719 - 724
  • [26] Multi-Layer Deep Sparse Representation for Biological Slice Image Inpainting
    Hu, Haitao
    Ma, Hongmei
    Mei, Shuli
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (03): : 3813 - 3832
  • [27] A Novel Model for Multi-label Image Annotation
    Wu, Xinjian
    Zhang, Li
    Li, Fanzhang
    Wang, Bangjun
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1953 - 1958
  • [28] A Multi-layer Representation Model for Multimedia Web Page and Topic
    Shi, Peng
    Hu, Changjun
    Ni, Yuemin
    Ding, Lianhong
    [J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1200 - +
  • [29] A multi-layer text classification framework based on two-level representation model
    Yun, Jiali
    Jing, Liping
    Yu, Jian
    Huang, Houkuan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) : 2035 - 2046
  • [30] Hybrid CNN-Transformer model for medical image segmentation with pyramid convolution and multi-layer perceptron
    Liu, Xiaowei
    Hu, Yikun
    Chen, Jianguo
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86