Fine-grained Image Classification by Visual-Semantic Embedding

被引:0
|
作者
Xu, Huapeng [1 ]
Qi, Guilin [1 ]
Li, Jingjing [2 ]
Wang, Meng [3 ]
Xu, Kang [4 ]
Gao, Huan [1 ]
机构
[1] Southeast Univ, Nanjing, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[3] Xi An Jiao Tong Univ, Xian, Peoples R China
[4] Nanjing Univ Posts & Telecommun, Nanjing, Peoples R China
基金
国家重点研发计划; 中国博士后科学基金; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates a challenging problem, which is known as fine-grained image classification (FGIC). Different from conventional computer vision problems, FGIC suffers from the large intraclass diversities and subtle inter-class differences. Existing FGIC approaches are limited to explore only the visual information embedded in the images. In this paper, we present a novel approach which can use handy prior knowledge from either structured knowledge bases or unstructured text to facilitate FGIC. Specifically, we propose a visual-semantic embedding model which explores semantic embedding from knowledge bases and text, and further trains a novel end-to-end CNN framework to linearly map image features to a rich semantic embedding space. Experimental results on a challenging large-scale UCSD Bird-200-2011 dataset verify that our approach outperforms several state-of-the-art methods with significant advances.
引用
收藏
页码:1043 / 1049
页数:7
相关论文
共 50 条
  • [1] Efficient Image Embedding for Fine-Grained Visual Classification
    Payatsuporn, Soranan
    Kijsirikul, Boonserm
    [J]. 2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 40 - 45
  • [2] Ultra Fine-Grained Image Semantic Embedding
    Juan, Da-Cheng
    Lu, Chun-To
    Li, Zhen
    Peng, Futang
    Timofeev, Aleksei
    Chen, Yi-Ting
    Gao, Yaxi
    Duerig, Tom
    Tomkins, Andrew
    Ravi, Sujith
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 277 - 285
  • [3] Partial visual-semantic embedding: Fine-grained outfit image representation with massive volumes of tags via angular-based contrastive learning
    Shimizu, Ryotaro
    Nakamura, Takuma
    Goto, Masayuki
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [4] ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding
    Hanselmann, Harald
    Ney, Hermann
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1236 - 1245
  • [5] Leveraging Fine-Grained Labels to Regularize Fine-Grained Visual Classification
    Wu, Junfeng
    Yao, Li
    Liu, Bin
    Ding, Zheyuan
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION (ICCMS 2019) AND 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS (ICICA 2019), 2019, : 133 - 136
  • [6] Fine-grained and Semantic-guided Visual Attention for Image Captioning
    Zhang, Zongjian
    Wu, Qiang
    Wang, Yang
    Chen, Fang
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1709 - 1717
  • [7] Image local structure information learning for fine-grained visual classification
    Jin Lu
    Weichuan Zhang
    Yali Zhao
    Changming Sun
    [J]. Scientific Reports, 12
  • [8] Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification
    Bai, Xiang
    Yang, Mingkun
    Lyu, Pengyuan
    Xu, Yongchao
    Luo, Jiebo
    [J]. IEEE ACCESS, 2018, 6 : 66322 - 66335
  • [9] Image local structure information learning for fine-grained visual classification
    Lu, Jin
    Zhang, Weichuan
    Zhao, Yali
    Sun, Changming
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [10] Multilabel Deep Visual-Semantic Embedding
    Yeh, Mei-Chen
    Li, Yi-Nan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (06) : 1530 - 1536