Few-Shot Learning on Edge Devices Using CLIP: A Resource-Efficient Approach for Image Classification

被引:0
|
作者
Lu, Jin [1 ]
机构
[1] Shenzhen Polytech Univ, Guangdong Key Lab Big Data Intelligence Vocat Educ, Shenzhen 518055, Guangdong, Peoples R China
来源
INFORMATION TECHNOLOGY AND CONTROL | 2024年 / 53卷 / 03期
关键词
Few-shot learning; CLIP model; image classification; edge devices; deep learnig;
D O I
10.5755/j01.itc.53.3.36943
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of deep learning, traditional image classification tasks typically require extensive annotated data-sets and complex model training processes, which pose significant challenges for deployment on resource-con-strained edge devices. To address these challenges, this study introduces a few-shot learning method based on OpenAI's CLIP model that significantly reduces computational demands by eliminating the need to run a text encoder at the inference stage. By pre-computing the embedding centers of classification text with a small set of image-text data, our approach enables the direct use of CLIP's image encoder and pre-calculated text embeddings for efficient image classification. This adaptation not only allows for high-precision classification tasks on edge devices with limited computing capabilities but also achieves accuracy and recall rates that close-ly approximate those of the pre-trained ResNet approach while using far less data. Furthermore, our method halves the memory usage compared to other large-scale visual models of similar capacity by avoiding the use of a text encoder during inference, making it particularly suitable for low-resource environments. This com-parative advantage underscores the efficiency of our approach in handling few-shot image classification tasks, demonstrating both competitive accuracy and practical viability in resource-limited settings. The outcomes of this research not only highlight the potential of the CLIP model in few-shot learning scenarios but also pave a new path for efficient, low-resource deep learning applications in edge computing environments
引用
收藏
页数:324
相关论文
共 50 条
  • [31] A Deep few-shot learning algorithm for hyperspectral image classification
    Liu B.
    Zuo X.
    Tan X.
    Yu A.
    Guo W.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2020, 49 (10): : 1331 - 1342
  • [32] SELF-SUPERVISED LEARNING FOR FEW-SHOT IMAGE CLASSIFICATION
    Chen, Da
    Chen, Yuefeng
    Li, Yuhong
    Mao, Feng
    He, Yuan
    Xue, Hui
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1745 - 1749
  • [33] Dual class representation learning for few-shot image classification
    Singh, Pravendra
    Mazumder, Pratik
    KNOWLEDGE-BASED SYSTEMS, 2022, 238
  • [34] MPPCANet: A feedforward learning strategy for few-shot image classification
    Song, Yu
    Chen, Changsheng
    PATTERN RECOGNITION, 2021, 113
  • [35] Efficient few-shot machine learning for classification of EBSD patterns
    Kevin Kaufmann
    Hobson Lane
    Xiao Liu
    Kenneth S. Vecchio
    Scientific Reports, 11
  • [36] A Differentiable Architecture Search Approach for Few-Shot Image Classification
    He, Chunmao
    Zhang, Lingyun
    Huang, Songqing
    Zhang, Pingjian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 521 - 532
  • [37] Proto-Adapter: Efficient Training-Free CLIP-Adapter for Few-Shot Image Classification
    Kato, Naoki
    Nota, Yoshiki
    Aoki, Yoshimitsu
    SENSORS, 2024, 24 (11)
  • [38] Efficient few-shot machine learning for classification of EBSD patterns
    Kaufmann, Kevin
    Lane, Hobson
    Liu, Xiao
    Vecchio, Kenneth S.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [39] PROTODA: EFFICIENT TRANSFER LEARNING FOR FEW-SHOT INTENT CLASSIFICATION
    Kumar, Manoj
    Kumar, Varun
    Glaude, Hadrien
    Delichy, Cyprien
    Alok, Aman
    Gupta, Rahul
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 966 - 972
  • [40] Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach
    Zhong, Xian
    Gu, Cheng
    Huang, Wenxin
    Li, Lin
    Chen, Shuqin
    Lin, Chia-Wen
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2677 - 2684