Lightweight Vision Transformer for damaged wheat detection and classification using spectrograms

被引:0
|
作者
Lin, Hao [1 ]
Guo, Min [1 ]
Ma, Miao [1 ]
机构
[1] Shaanxi Normal Univ, Sch Comp Sci, Key Lab Modern Teaching Technol, Minist Educ, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
neural architecture search; auto machine learning; wheat kernels; classification; spectrogram;
D O I
10.1117/1.JEI.33.5.053063
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Grain is one of the basic human necessities, and its quality and safety directly impact human dietary health. Various issues occur during grain storage, primarily mold and pest infestation. With the development of artificial intelligence, increasingly, more technologies are applied to grain detection and classification. Transformer-based models are becoming popular in grain detection. Although transformer models exhibit excellent performance, they are often large and cumbersome, limiting practical applications. We propose a framework named KD-ASF based on intermediate layer knowledge distillation and one-shot neural architecture search, to optimize the hyperparameters of vision transformer (ViT) for detecting and classifying molded wheat kernels (MDK), Insect-Damaged wheat kernels (IDK), and undamaged wheat kernels (UDK). In KD-ASF, we use the ViT model as our teacher network. Next, we design a search space containing adjustable hyperparameters of transformer building blocks. The super-network stacks maximum transformer building blocks and is trained under the guidance of the teacher network. Subsequently, the trained super-network undergoes evolutionary search, and the resulting networks are used for classifying different wheat kernels. We conducted experiments using a five-fold cross-validation approach and obtained an F1 score of 97.13%, and the last model parameter size is only 5.94M. The results demonstrate that this method not only outperforms the majority of neural networks in terms of performance but also has a significantly smaller model size than most network models. Its lightweight nature facilitates easy deployment and application. These findings indicate that the structure of KD-ASF is feasible and effective. (c) 2024 SPIE and IS&T
引用
收藏
页数:16
相关论文
共 50 条
  • [1] SkinDistilViT: Lightweight Vision Transformer for Skin Lesion Classification
    Lungu-Stan, Vlad-Constantin
    Cercel, Dumitru-Clementin
    Pop, Florin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT I, 2023, 14254 : 268 - 280
  • [2] Driver distraction detection using semi-supervised lightweight vision transformer
    Mohammed, Adam A. Q.
    Geng, Xin
    Wang, Jing
    Ali, Zafar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
  • [3] Efficient identification and classification of apple leaf diseases using lightweight vision transformer (ViT)
    Ullah, Wasi
    Javed, Kashif
    Khan, Muhammad Attique
    Alghayadh, Faisal Yousef
    Bhatt, Mohammed Wasim
    Al Naimi, Imad Saud
    Ofori, Isaac
    DISCOVER SUSTAINABILITY, 2024, 5 (01):
  • [4] Underwater image enhancement using lightweight vision transformer
    Daud, Muneeba
    Afzal, Hammad
    Mahmood, Khawir
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 75603 - 75625
  • [5] DETECTION, ESTIMATION, AND CLASSIFICATION WITH SPECTROGRAMS
    ALTES, RA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 67 (04): : 1232 - 1246
  • [6] Detection and Classification of Mental Stress Using In-Ear Plethysmography and a Vision Transformer
    Barki, Hika
    Nkenyereye, Lionel
    Chung, Wan-Young
    IEEE SENSORS JOURNAL, 2025, 25 (02) : 4015 - 4027
  • [7] Lightweight Low-Rank Adaptation Vision Transformer Framework for Cervical Cancer Detection and Cervix Type Classification
    Hong, Zhenchen
    Xiong, Jingwei
    Yang, Han
    Mo, Yu K.
    BIOENGINEERING-BASEL, 2024, 11 (05):
  • [8] An Intelligent System for Outfall Detection in UAV Images Using Lightweight Convolutional Vision Transformer Network
    Yu, Mingxin
    Zhang, Ji
    Zhu, Lianqing
    Liang, Shengjun
    Lu, Wenshuai
    Ji, Xinglong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 6265 - 6277
  • [9] Glaucoma Classification using Light Vision Transformer
    Singh P.B.
    Singh P.
    Dev H.
    Tiwari A.
    Batra D.
    Chaurasia B.K.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2023, 9
  • [10] Diabetic Retinopathy Classification using Vision Transformer
    Mutawa, A. M.
    Sruthi, Sai
    2022 6TH EUROPEAN CONFERENCE ON ELECTRICAL ENGINEERING & COMPUTER SCIENCE, ELECS, 2022, : 25 - 30