Lightweight Vision Transformer for damaged wheat detection and classification using spectrograms

被引:0
|
作者
Lin, Hao [1 ]
Guo, Min [1 ]
Ma, Miao [1 ]
机构
[1] Shaanxi Normal Univ, Sch Comp Sci, Key Lab Modern Teaching Technol, Minist Educ, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
neural architecture search; auto machine learning; wheat kernels; classification; spectrogram;
D O I
10.1117/1.JEI.33.5.053063
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Grain is one of the basic human necessities, and its quality and safety directly impact human dietary health. Various issues occur during grain storage, primarily mold and pest infestation. With the development of artificial intelligence, increasingly, more technologies are applied to grain detection and classification. Transformer-based models are becoming popular in grain detection. Although transformer models exhibit excellent performance, they are often large and cumbersome, limiting practical applications. We propose a framework named KD-ASF based on intermediate layer knowledge distillation and one-shot neural architecture search, to optimize the hyperparameters of vision transformer (ViT) for detecting and classifying molded wheat kernels (MDK), Insect-Damaged wheat kernels (IDK), and undamaged wheat kernels (UDK). In KD-ASF, we use the ViT model as our teacher network. Next, we design a search space containing adjustable hyperparameters of transformer building blocks. The super-network stacks maximum transformer building blocks and is trained under the guidance of the teacher network. Subsequently, the trained super-network undergoes evolutionary search, and the resulting networks are used for classifying different wheat kernels. We conducted experiments using a five-fold cross-validation approach and obtained an F1 score of 97.13%, and the last model parameter size is only 5.94M. The results demonstrate that this method not only outperforms the majority of neural networks in terms of performance but also has a significantly smaller model size than most network models. Its lightweight nature facilitates easy deployment and application. These findings indicate that the structure of KD-ASF is feasible and effective. (c) 2024 SPIE and IS&T
引用
收藏
页数:16
相关论文
共 50 条
  • [31] A vision transformer for emphysema classification using CT images
    Wu, Yanan
    Qi, Shouliang
    Sun, Yu
    Xia, Shuyue
    Yao, Yudong
    Qian, Wei
    PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (24):
  • [32] Driver Drowsiness Detection Using Vision Transformer
    Usmani, Shaheen
    Chandwani, Bharat
    Sadhya, Debanjan
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 445 - 454
  • [33] Face Mask Detection using Vision Transformer
    Pandya, Bhavik
    Patel, Darshana
    Yow, Kin-Choong
    2023 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE, 2023,
  • [34] Fall Event Detection using Vision Transformer
    Dey, Ankita
    Rajan, Sreeraman
    Xiao, George
    Lu, Jianping
    2022 IEEE SENSORS, 2022,
  • [35] Pupil Detection Using Hybrid Vision Transformer
    Wang, Li
    Wang, Changyuan
    Zhang, Yu
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (12)
  • [36] Image Quality Distortion Classification Using Vision Transformer
    Lynn, Nay Chi
    Shimamura, Tetsuya
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, AINA 2024, 2024, 199 : 353 - 361
  • [37] Dual level attention based lightweight vision transformer for streambed land use change classification using remote sensing
    Bansal, Kamakhya
    Tripathi, Ashish Kumar
    COMPUTERS & GEOSCIENCES, 2024, 191
  • [38] Identification of damaged kernels in wheat using a colour machine vision system
    Luo, X
    Jayas, DS
    Symons, SJ
    JOURNAL OF CEREAL SCIENCE, 1999, 30 (01) : 49 - 59
  • [39] LightSleepNet: A Lightweight Deep Model for Rapid Sleep Stage Classification with Spectrograms
    Zhou, Dongdong
    Xu, Qi
    Wang, Jian
    Zhang, Jiacheng
    Hu, Guoqiang
    Kettunen, Lauri
    Chang, Zheng
    Cong, Fengyu
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 43 - 46
  • [40] Assist-Dermo: A Lightweight Separable Vision Transformer Model for Multiclass Skin Lesion Classification
    Abbas, Qaisar
    Daadaa, Yassine
    Rashid, Umer
    Ibrahim, Mostafa E. A.
    DIAGNOSTICS, 2023, 13 (15)