Few-Shot Learning on Edge Devices Using CLIP: A Resource-Efficient Approach for Image Classification

被引：0

作者：

Lu, Jin ^{[1
]}

机构：

[1] Shenzhen Polytech Univ, Guangdong Key Lab Big Data Intelligence Vocat Educ, Shenzhen 518055, Guangdong, Peoples R China

来源：

INFORMATION TECHNOLOGY AND CONTROL | 2024年 / 53卷 / 03期

关键词：

Few-shot learning; CLIP model; image classification; edge devices; deep learnig;

D O I：

10.5755/j01.itc.53.3.36943

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the field of deep learning, traditional image classification tasks typically require extensive annotated data-sets and complex model training processes, which pose significant challenges for deployment on resource-con-strained edge devices. To address these challenges, this study introduces a few-shot learning method based on OpenAI's CLIP model that significantly reduces computational demands by eliminating the need to run a text encoder at the inference stage. By pre-computing the embedding centers of classification text with a small set of image-text data, our approach enables the direct use of CLIP's image encoder and pre-calculated text embeddings for efficient image classification. This adaptation not only allows for high-precision classification tasks on edge devices with limited computing capabilities but also achieves accuracy and recall rates that close-ly approximate those of the pre-trained ResNet approach while using far less data. Furthermore, our method halves the memory usage compared to other large-scale visual models of similar capacity by avoiding the use of a text encoder during inference, making it particularly suitable for low-resource environments. This com-parative advantage underscores the efficiency of our approach in handling few-shot image classification tasks, demonstrating both competitive accuracy and practical viability in resource-limited settings. The outcomes of this research not only highlight the potential of the CLIP model in few-shot learning scenarios but also pave a new path for efficient, low-resource deep learning applications in edge computing environments

引用

页数：324

共 50 条

[31] A Deep few-shot learning algorithm for hyperspectral image classification
Liu B.
Zuo X.
Tan X.
Yu A.
Guo W.
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2020, 49 (10): : 1331 - 1342
[32] SELF-SUPERVISED LEARNING FOR FEW-SHOT IMAGE CLASSIFICATION
Chen, Da
Chen, Yuefeng
Li, Yuhong
Mao, Feng
He, Yuan
Xue, Hui
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1745 - 1749
[33] Dual class representation learning for few-shot image classification
Singh, Pravendra
Mazumder, Pratik
KNOWLEDGE-BASED SYSTEMS, 2022, 238
[34] MPPCANet: A feedforward learning strategy for few-shot image classification
Song, Yu
Chen, Changsheng
PATTERN RECOGNITION, 2021, 113
[35] Efficient few-shot machine learning for classification of EBSD patterns
Kevin Kaufmann
Hobson Lane
Xiao Liu
Kenneth S. Vecchio
Scientific Reports, 11
[36] A Differentiable Architecture Search Approach for Few-Shot Image Classification
He, Chunmao
Zhang, Lingyun
Huang, Songqing
Zhang, Pingjian
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 521 - 532
[37] Proto-Adapter: Efficient Training-Free CLIP-Adapter for Few-Shot Image Classification
Kato, Naoki
Nota, Yoshiki
Aoki, Yoshimitsu
SENSORS, 2024, 24 (11)
[38] Efficient few-shot machine learning for classification of EBSD patterns
Kaufmann, Kevin
Lane, Hobson
Liu, Xiao
Vecchio, Kenneth S.
SCIENTIFIC REPORTS, 2021, 11 (01)
[39] PROTODA: EFFICIENT TRANSFER LEARNING FOR FEW-SHOT INTENT CLASSIFICATION
Kumar, Manoj
Kumar, Varun
Glaude, Hadrien
Delichy, Cyprien
Alok, Aman
Gupta, Rahul
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 966 - 972
[40] Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach
Zhong, Xian
Gu, Cheng
Huang, Wenxin
Li, Lin
Chen, Shuqin
Lin, Chia-Wen
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2677 - 2684

← 1 2 3 4 5 →