Zero-shot Image Categorization by Image Correlation Exploration

被引:10
|
作者
Gao, LianLi [1 ]
Song, Jingkuan [2 ]
Shao, Junming [1 ]
Zhu, Xiaofeng [3 ]
Shen, Heng Tao [4 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Univ Trento, Trento, Italy
[3] Guangxi Normal Univ, Guilin, Guangxi, Peoples R China
[4] Univ Queensland, Brisbane, Qld 4072, Australia
关键词
attributes; image categorization; zero-shot learning;
D O I
10.1145/2671188.2749309
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of image categorization from zero or only a few training examples, called zero-shot learning, occurs frequently, but it has hardly been studied in computer vision research. To tackle this problem, mid-level semantic attributes are introduced to identify image categories. For example, one can construct a classifier for the giant panda category by enumerating its attributes (e.g., black, white and four-footed) even without providing giant panda training images. Recently, several studies have investigated to learn attribute classifiers, based on which new classes can be detected. However, an often-encountered problem is the limited number of training data due to the time-consuming manual annotation of the attributes. Also, using single feature is hard to detect some attributes, e.g., the HSV feature is not robust enough to predict 'tusk' or 'flies' attributes. In this paper, we propose a unified semi-supervised learning (SSL) framework that learns the attribute classifiers by utilizing multiple feature and exploring the correlations between images. Specifically, we learn an optimal graph which embeds the relationships among the data points more accurately. Then, this graph is used to generate a geometrical regularizers for a semi-supervised learning model to learn the attribute classifier by utilizing both labeled and unlabeled images. Afterward, new classes can be detected based on their attribute representation. The use of SSL can boost the performances of attribute classifiers with very few training examples, and the adoption of multiple features makes the attribute prediction more robust. Experimental results on a series of real benchmark data sets suggest that semi-supervised learning do enhance the performances of attribute prediction and zero-shot categorization, compared with state-of-the-art methods.
引用
收藏
页码:487 / 490
页数:4
相关论文
共 50 条
  • [21] SEMANTIC AUGMENTATION HASHING FOR ZERO-SHOT IMAGE RETRIEVAL
    Zhong, Fangming
    Chen, Zhikui
    Min, Geyong
    Xia, Feng
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1943 - 1947
  • [22] Enhanced VAEGAN: a zero-shot image classification method
    Bo Ding
    Yufei Fan
    Yongjun He
    Jing Zhao
    Applied Intelligence, 2023, 53 : 9235 - 9246
  • [23] Dual triplet network for image zero-shot learning
    Ji, Zhong
    Wang, Hai
    Pang, Yanwei
    Shao, Ling
    NEUROCOMPUTING, 2020, 373 : 90 - 97
  • [24] Zero-shot Image Tagging by Hierarchical Semantic Embedding
    Li, Xirong
    Liao, Shuai
    Lan, Weiyu
    Du, Xiaoyong
    Yang, Gang
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 879 - 882
  • [25] Zero-shot Hashing with orthogonal projection for image retrieval
    Zhang, Haofeng
    Long, Yang
    Shao, Ling
    PATTERN RECOGNITION LETTERS, 2019, 117 : 201 - 209
  • [26] Zero-Shot Audio Classification using Image Embeddings
    Dogan, Duygu
    Xie, Huang
    Heittola, Toni
    Virtanen, Tuomas
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1 - 5
  • [27] Transductive Zero-Shot Hashing for Multilabel Image Retrieval
    Zou, Qin
    Cao, Ling
    Zhang, Zheng
    Chen, Long
    Wang, Song
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (04) : 1673 - 1687
  • [28] Zero-Shot Food Image Detection Based on Transformer
    Song, Jingru
    Min, Weiqing
    Zhou, Pengfei
    Rao, Quanrui
    Sheng, Guorui
    Yang, Yancun
    Wang, Lili
    Jiang, Shuqiang
    Science and Technology of Food Industry, 2024, 45 (22) : 18 - 26
  • [29] Ref-ZSSR: Zero-Shot Single Image Superresolution with Reference Image
    Han, Xianjun
    Wang, Xue
    Wang, Huabin
    Li, Xuejun
    Yang, Hongyu
    COMPUTER GRAPHICS FORUM, 2022, 41 (07) : 311 - 322
  • [30] Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
    Luol, Jianjie
    Chen, Jingwen
    Li, Yehao
    Pan, Yingwei
    Feng, Jianlin
    Cha, Hongyang
    Yao, Ting
    COMPUTER VISION-ECCV 2024, PT LVII, 2025, 15115 : 237 - 254