Multi-feature language-image model for fruit quality image classification

被引:0
|
作者
Duan, Jie-li [1 ]
Lai, Li-qian [1 ,3 ]
Yang, Zhou [1 ,2 ]
Luo, Zhi-jian [3 ]
Yuan, Hao-tian [1 ]
机构
[1] South China Agr Univ, Coll Engn, Guangzhou 510642, Guangdong, Peoples R China
[2] Guangdong Ocean Univ, Sch Mech Engn, Zhanjiang 524088, Guangdong, Peoples R China
[3] Jiaying Univ, Coll Comp Sci, Meizhou 514015, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial intelligence; Fruit quality classification; Language-image model; Few-shot learning;
D O I
10.1016/j.compag.2024.109462
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Fruit quality classification has a great impact on the modern fruit industry. However, deep learning methods for fruit quality classification often demand a substantial number of labeled samples, which are hard and expensive to collect in many real-world applications, resulting in overfitting and low generalization. The Contrastive Language-Image Pre-Training (CLIP) model, which fuses image and text features, has demonstrated excellent performance in zero-shot classification. Inspired by CLIP, in this paper, we propose a multi-feature language-image (MFLI) model for fruit quality classification, where the fruit image and feature text are fused to enhance feature extraction. Furthermore, we construct a pomelo quality dataset containing first- and secondgrade pomelo. Based on the zero-shot learning results of CLIP on this dataset, we provide recommendations for pre-prompt and multi-feature text. Experimental results show that in both zero-shot, few-shot, and conventional learning sceneries, our MFLI model outperforms state-of-the-art models on seven types of fruits, demonstrating excellent generalization capabilities.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] EPLL image denoising with multi-feature dictionaries
    Yu, Qiqiong
    Cao, Guo
    Shi, Hao
    Zhang, Youqiang
    Fu, Peng
    DIGITAL SIGNAL PROCESSING, 2023, 137
  • [32] Complex matrix and multi-feature collaborative learning for polarimetric SAR image classification
    Shi, Junfei
    Wang, Wei
    Jin, Haiyan
    He, Tiansheng
    APPLIED SOFT COMPUTING, 2023, 134
  • [33] Multi-feature Late Fusion for Image Tagging
    Liu, Xi
    Liu, Rujie
    Cao, Qiong
    Li, Fei
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 34 - 37
  • [34] Multi-Feature Fusion with SVM Classification for Crime Scene Investigation Image Retrieval
    Liu, Ying
    Wang, Fuping
    Hu, Dan
    Fan, Jiulun
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2017, : 160 - 165
  • [35] Adaptive Multi-Feature Fusion Graph Convolutional Network for Hyperspectral Image Classification
    Liu, Jie
    Guan, Renxiang
    Li, Zihao
    Zhang, Jiaxuan
    Hu, Yaowen
    Wang, Xueyong
    REMOTE SENSING, 2023, 15 (23)
  • [36] The language-image as textual interaction
    Lerchner, G
    ZEITSCHRIFT FUR GERMANISTIK, 2000, 10 (03): : 706 - 708
  • [37] MULTI-FEATURE MUTUAL INFORMATION IMAGE REGISTRATION
    Tomazevic, Dejan
    Likar, Bostjan
    Pernus, Franjo
    IMAGE ANALYSIS & STEREOLOGY, 2012, 31 (01): : 43 - 53
  • [38] Image retrieval based on multi-feature fusion
    Dong Wenfei
    Yu Shuchun
    Liu Songyu
    Zhang Zhiqiang
    Gu Wenbo
    2014 FOURTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2014, : 240 - 243
  • [39] A Multi-Feature Fusion Model Based on Denoising Convolutional Neural Network and Attention Mechanism for Image Classification
    Zhang, Jingsi
    Yu, Xiaosheng
    Lei, Xiaoliang
    Wu, Chengdong
    INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2023, 14 (02)
  • [40] Multi-feature contrastive learning for unpaired image-to-image translation
    Gou, Yao
    Li, Min
    Song, Yu
    He, Yujie
    Wang, Litao
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 4111 - 4122