Gaze Embeddings for Zero-Shot Image Classification

被引:49
|
作者
Karessli, Nour [1 ,3 ]
Akata, Zeynep [1 ,2 ]
Schiele, Bernt [1 ]
Bulling, Andreas [1 ]
机构
[1] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
[2] Univ Amsterdam, Amsterdam Machine Learning Lab, Amsterdam, Netherlands
[3] Eyeem, Berlin, Germany
关键词
D O I
10.1109/CVPR.2017.679
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot image classification using auxiliary information, such as attributes describing discriminative object properties, requires time-consuming annotation by domain experts. We instead propose a method that relies on human gaze as auxiliary information, exploiting that even nonexpert users have a natural ability to judge class membership. We present a data collection paradigm that involves a discrimination task to increase the information content obtained from gaze data. Our method extracts discriminative descriptors from the data and learns a compatibility function between image and gaze using three novel gaze embeddings: Gaze Histograms (GH), Gaze Features with Grid (GFG) and Gaze Features with Sequence (GFS). We introduce two new gaze-annotated datasets for fine-grained image classification and show that human gaze data is indeed class discriminative, provides a competitive alternative to expert-annotated attributes, and outperforms other baselines for zero-shot image classification.
引用
收藏
页码:6412 / 6421
页数:10
相关论文
共 50 条
  • [1] Zero-Shot Audio Classification using Image Embeddings
    Dogan, Duygu
    Xie, Huang
    Heittola, Toni
    Virtanen, Tuomas
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1 - 5
  • [2] Latent Embeddings for Zero-shot Classification
    Xian, Yongqin
    Akata, Zeynep
    Sharma, Gaurav
    Nguyen, Quynh
    Hein, Matthias
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 69 - 77
  • [3] Zero-Shot Audio Classification Via Semantic Embeddings
    Xie, Huang
    Virtanen, Tuomas
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1233 - 1242
  • [4] ZERO-SHOT AUDIO CLASSIFICATION BASED ON CLASS LABEL EMBEDDINGS
    Xie, Huang
    Virtanen, Tuomas
    [J]. 2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 264 - 267
  • [5] Multimodal Ensembling for Zero-Shot Image Classification
    Hickmon, Javon
    [J]. THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23747 - 23749
  • [6] Zero-Shot Image Classification Based on Attribute
    Zhang, Wei
    Chen, Wenbai
    Chen, Xiangfeng
    Han, Hu
    [J]. 2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 25 - 30
  • [7] Method for improving zero-shot image classification
    Chen, Xiangfeng
    Chen, Wenbai
    Zhang, Chong
    Lv, Mengyao
    Han, Hu
    [J]. JOURNAL OF ENGINEERING-JOE, 2018, (16): : 1688 - 1691
  • [8] Enhanced VAEGAN: a zero-shot image classification method
    Ding, Bo
    Fan, Yufei
    He, Yongjun
    Zhao, Jing
    [J]. APPLIED INTELLIGENCE, 2023, 53 (08) : 9235 - 9246
  • [9] Zero-shot image classification based on factor space
    Guan, Shijie
    Guan, Qixue
    Yin, Anqi
    [J]. Guan, Shijie (shijieguan@sylu.edu.cn), 1600, Inderscience Publishers (16): : 1 - 29
  • [10] Enhanced VAEGAN: a zero-shot image classification method
    Bo Ding
    Yufei Fan
    Yongjun He
    Jing Zhao
    [J]. Applied Intelligence, 2023, 53 : 9235 - 9246