Label Propagation for Zero-shot Classification with Vision-Language Models

被引:0
|
作者
Stojnic, Vladan [1 ]
Kalantidis, Yannis [2 ]
Tolias, Giorgos [1 ]
机构
[1] Czech Tech Univ, FEE, VRG, Prague, Czech Republic
[2] NAVER LABS Europe, Meylan, France
关键词
D O I
10.1109/CVPR52733.2024.02190
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision-Language Models (VLMs) have demonstrated impressive performance on zero-shot classification, i.e. classification when provided merely with a list of class names. In this paper, we tackle the case of zero-shot classification in the presence of unlabeled data. We leverage the graph structure of the unlabeled data and introduce ZLaP, a method based on label propagation (LP) that utilizes geodesic distances for classification. We tailor LP to graphs containing both text and image features and further propose an efficient method for performing inductive inference based on a dual solution and a sparsification step. We perform extensive experiments to evaluate the effectiveness of our method on 14 common datasets and show that ZLaP outperforms the latest related works. Code: https://github.com/vladan-stojnic/ZLaP
引用
收藏
页码:23209 / 23218
页数:10
相关论文
共 50 条
  • [31] ZERO-SHOT AUDIO CLASSIFICATION BASED ON CLASS LABEL EMBEDDINGS
    Xie, Huang
    Virtanen, Tuomas
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 264 - 267
  • [32] Large Language Models as Zero-Shot Conversational Recommenders
    He, Zhankui
    Xie, Zhouhang
    Jha, Rahul
    Steck, Harald
    Liang, Dawen
    Feng, Yesu
    Majumder, Bodhisattwa Prasad
    Kallus, Nathan
    McAuley, Julian
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 720 - 730
  • [33] Few-Shot Adaptation of Medical Vision-Language Models
    Shakeri, Fereshteh
    Huang, Yunshi
    Silva-Rodriguez, Julio
    Bahig, Houda
    Tang, An
    Dolz, Jose
    Ben Ayed, Ismail
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 553 - 563
  • [34] Zero-Shot Classification by Logical Reasoning on Natural Language Explanations
    Han, Chi
    Pei, Hengzhi
    Du, Xinya
    Ji, Heng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8967 - 8981
  • [35] Extensible Prompts for Language Models on Zero-shot Language Style Customization
    Ge, Tao
    Hu, Jing
    Dong, Li
    Mao, Shaoguang
    Xia, Yan
    Wang, Xun
    Chen, Si-Qing
    Wei, Furu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [36] MULTI-LABEL ZERO-SHOT AUDIO CLASSIFICATION WITH TEMPORAL ATTENTION
    Dogan, Duygu
    Xie, Huang
    Heittola, Toni
    Virtanen, Tuomas
    2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 250 - 254
  • [37] Deep Ranking for Image Zero-Shot Multi-Label Classification
    Ji, Zhong
    Cui, Biying
    Li, Huihui
    Jiang, Yu-Gang
    Xiang, Tao
    Hospedales, Timothy
    Fu, Yanwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 6549 - 6560
  • [38] Semantic Diversity Learning for Zero-Shot Multi-label Classification
    Ben-Cohen, Avi
    Zamir, Nadav
    Ben Baruch, Emanuel
    Friedman, Itamar
    Zelnik-Manor, Lihi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 620 - 630
  • [39] The Benefits of Label-Description Training for Zero-Shot Text Classification
    Gao, Lingyu
    Ghosh, Debanjan
    Gimpel, Kevin
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13823 - 13844
  • [40] Semi-Supervised Zero-Shot Classification with Label Representation Learning
    Li, Xin
    Guo, Yuhong
    Schuurmans, Dale
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4211 - 4219