A Multitask Learning-Based Vision Transformer for Plant Disease Localization and Classification

被引:0
|
作者
Hemalatha, S. [1 ]
Jayachandran, Jai Jaganath Babu [2 ]
机构
[1] Rajalakshmi Engn Coll, Dept Artificial Intelligence & Machine Learning, Chennai 602105, India
[2] Chennai Inst Technol, Dept Biomed Engn, Chennai 600069, India
关键词
Plant disease; Classification; Localization; Vision transformer; Multi-task learning;
D O I
10.1007/s44196-024-00597-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Plant disease detection is a critical task in agriculture, essential for ensuring crop health and productivity. Traditional methods in this context are often labor-intensive and prone to errors, highlighting the need for automated solutions. While computer vision-based solutions have been successfully deployed in recent years for plant disease identification and localization tasks, these often operate independently, leading to suboptimal performance. It is essential to develop an integrated solution combining these two tasks for improved efficiency and accuracy. This research proposes the innovative Plant Disease Localization and Classification model based on Vision Transformer (PDLC-ViT), which integrates co-scale, co-attention, and cross-attention mechanisms and a ViT, within a Multi-Task Learning (MTL) framework. The model was trained and evaluated on the Plant Village dataset. Key hyperparameters, including learning rate, batch size, dropout ratio, and regularization factor, were optimized through a thorough grid search. Early stopping based on validation loss was employed to prevent overfitting. The PDLC-ViT model demonstrated significant improvements in plant disease localization and classification tasks. The integration of co-scale, co-attention, and cross-attention mechanisms allowed the model to capture multi-scale dependencies and enhance feature learning, leading to superior performance compared to existing models. The PDLC-ViT model evaluated on two public datasets achieved an accuracy of 99.97%, a Mean Average Precision (MAP) of 99.18%, and a Mean Average Recall (MAR) of 99.11%. These results underscore the model's exceptional precision and recall, highlighting its robustness and reliability in detecting and classifying plant diseases. The PDLC-ViT model sets a new benchmark in plant disease detection, offering a reliable and advanced tool for agricultural applications. Its ability to integrate localization and classification tasks within an MTL framework promotes timely and accurate disease management, contributing to sustainable agriculture and food security.
引用
收藏
页数:21
相关论文
共 50 条
  • [11] Systematic study on deep learning-based plant disease detection or classification
    C. K. Sunil
    C. D. Jaidhar
    Nagamma Patil
    [J]. Artificial Intelligence Review, 2023, 56 : 14955 - 15052
  • [12] Improving Deep Learning-based Plant Disease Classification with Attention Mechanism
    Alirezazadeh, Pendar
    Schirrmann, Michael
    Stolzenburg, Frieder
    [J]. GESUNDE PFLANZEN, 2023, 75 (01): : 49 - 59
  • [13] Vision transformer meets convolutional neural network for plant disease classification
    Thakur, Poornima Singh
    Chaturvedi, Shubhangi
    Khanna, Pritee
    Sheorey, Tanuja
    Ojha, Aparajita
    [J]. ECOLOGICAL INFORMATICS, 2023, 77
  • [14] Vision-Transformer-Based Transfer Learning for Mammogram Classification
    Ayana, Gelan
    Dese, Kokeb
    Dereje, Yisak
    Kebede, Yonas
    Barki, Hika
    Amdissa, Dechassa
    Husen, Nahimiya
    Mulugeta, Fikadu
    Habtamu, Bontu
    Choe, Se-Woon
    [J]. DIAGNOSTICS, 2023, 13 (02)
  • [15] Using transfer learning-based plant disease classification and detection for sustainable agriculture
    Shafik, Wasswa
    Tufail, Ali
    De Silva Liyanage, Chandratilak
    Apong, Rosyzie Anna Awg Haji Mohd
    [J]. BMC PLANT BIOLOGY, 2024, 24 (01)
  • [16] Using transfer learning-based plant disease classification and detection for sustainable agriculture
    Wasswa Shafik
    Ali Tufail
    Chandratilak De Silva Liyanage
    Rosyzie Anna Awg Haji Mohd Apong
    [J]. BMC Plant Biology, 24
  • [17] A Deep Learning-Based Approach for Cervical Cancer Classification Using 3D CNN and Vision Transformer
    Abinaya, K.
    Sivakumar, B.
    [J]. JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (01): : 280 - 296
  • [18] Olive Disease Classification Based on Vision Transformer and CNN Models
    Alshammari, Hamoud
    Gasmi, Karim
    Ben Ltaifa, Ibtihel
    Krichen, Moez
    Ben Ammar, Lassaad
    Mahmood, Mahmood A.
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [19] InViTMixup: plant disease classification using convolutional vision transformer with Mixup augmentation
    Devi, R. S. Sandhya
    Kumar, V. R. Vijay
    Sivakumar, P.
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2024, 47 (05) : 520 - 527
  • [20] An Explainable Vision Transformer Model Based White Blood Cells Classification and Localization
    Katar, Oguzhan
    Yildirim, Ozal
    [J]. DIAGNOSTICS, 2023, 13 (14)