Zero-shot Image Categorization by Image Correlation Exploration

被引:10
|
作者
Gao, LianLi [1 ]
Song, Jingkuan [2 ]
Shao, Junming [1 ]
Zhu, Xiaofeng [3 ]
Shen, Heng Tao [4 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Univ Trento, Trento, Italy
[3] Guangxi Normal Univ, Guilin, Guangxi, Peoples R China
[4] Univ Queensland, Brisbane, Qld 4072, Australia
关键词
attributes; image categorization; zero-shot learning;
D O I
10.1145/2671188.2749309
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of image categorization from zero or only a few training examples, called zero-shot learning, occurs frequently, but it has hardly been studied in computer vision research. To tackle this problem, mid-level semantic attributes are introduced to identify image categories. For example, one can construct a classifier for the giant panda category by enumerating its attributes (e.g., black, white and four-footed) even without providing giant panda training images. Recently, several studies have investigated to learn attribute classifiers, based on which new classes can be detected. However, an often-encountered problem is the limited number of training data due to the time-consuming manual annotation of the attributes. Also, using single feature is hard to detect some attributes, e.g., the HSV feature is not robust enough to predict 'tusk' or 'flies' attributes. In this paper, we propose a unified semi-supervised learning (SSL) framework that learns the attribute classifiers by utilizing multiple feature and exploring the correlations between images. Specifically, we learn an optimal graph which embeds the relationships among the data points more accurately. Then, this graph is used to generate a geometrical regularizers for a semi-supervised learning model to learn the attribute classifier by utilizing both labeled and unlabeled images. Afterward, new classes can be detected based on their attribute representation. The use of SSL can boost the performances of attribute classifiers with very few training examples, and the adoption of multiple features makes the attribute prediction more robust. Experimental results on a series of real benchmark data sets suggest that semi-supervised learning do enhance the performances of attribute prediction and zero-shot categorization, compared with state-of-the-art methods.
引用
收藏
页码:487 / 490
页数:4
相关论文
共 50 条
  • [41] ZMFF: Zero-shot multi-focus image fusion
    Hu, Xingyu
    Jiang, Junjun
    Liu, Xianming
    Ma, Jiayi
    INFORMATION FUSION, 2023, 92 : 127 - 138
  • [42] Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
    Fei, Junjie
    Wang, Teng
    Zhang, Jinrui
    He, Zhenyu
    Wang, Chengjie
    Zheng, Feng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3113 - 3123
  • [43] AnyDoor: Zero-shot Object-level Image Customization
    Chen, Xi
    Huang, Lianghua
    Liu, Yu
    Shen, Yujun
    Zhao, Deli
    Zhao, Hengshuang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 6593 - 6602
  • [44] Review of Zero-Shot Remote Sensing Image Scene Classification
    Tan, Xiaomeng
    Xi, Bobo
    Li, Jiaojiao
    Zheng, Tie
    Li, Yunsong
    Xue, Changbin
    Chanussot, Jocelyn
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11274 - 11289
  • [45] Inductive Zero-Shot Image Annotation via Embedding Graph
    Wang, Fangxin
    Liu, Jie
    Zhang, Shuwu
    Zhang, Guixuan
    Li, Yuejun
    Yuan, Fei
    IEEE ACCESS, 2019, 7 : 107816 - 107830
  • [46] Zero-shot unsupervised image-to-image translation via exploiting semantic attributes
    Chen, Yuanqi
    Yu, Xiaoming
    Liu, Shan
    Gao, Wei
    Li, Ge
    IMAGE AND VISION COMPUTING, 2022, 124
  • [47] Zero-shot unsupervised image-to-image translation via exploiting semantic attributes
    Chen, Yuanqi
    Yu, Xiaoming
    Liu, Shan
    Gao, Wei
    Li, Ge
    Image and Vision Computing, 2022, 124
  • [48] A Cross-Modal Alignment for Zero-Shot Image Classification
    Wu, Lu
    Wu, Chenyu
    Guo, Han
    Zhao, Zhihao
    IEEE ACCESS, 2023, 11 : 9067 - 9073
  • [49] Zero-shot image classification based on generative adversarial network
    Wei H.
    Zhang Y.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (12): : 2345 - 2350
  • [50] FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
    Chen, Zhekai
    Wang, Wen
    Yang, Zhen
    Yuan, Zeqing
    Chen, Hao
    Shen, Chunhua
    COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 70 - 87