A Multitask Learning-Based Vision Transformer for Plant Disease Localization and Classification

被引:0
|
作者
Hemalatha, S. [1 ]
Jayachandran, Jai Jaganath Babu [2 ]
机构
[1] Rajalakshmi Engn Coll, Dept Artificial Intelligence & Machine Learning, Chennai 602105, India
[2] Chennai Inst Technol, Dept Biomed Engn, Chennai 600069, India
关键词
Plant disease; Classification; Localization; Vision transformer; Multi-task learning;
D O I
10.1007/s44196-024-00597-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Plant disease detection is a critical task in agriculture, essential for ensuring crop health and productivity. Traditional methods in this context are often labor-intensive and prone to errors, highlighting the need for automated solutions. While computer vision-based solutions have been successfully deployed in recent years for plant disease identification and localization tasks, these often operate independently, leading to suboptimal performance. It is essential to develop an integrated solution combining these two tasks for improved efficiency and accuracy. This research proposes the innovative Plant Disease Localization and Classification model based on Vision Transformer (PDLC-ViT), which integrates co-scale, co-attention, and cross-attention mechanisms and a ViT, within a Multi-Task Learning (MTL) framework. The model was trained and evaluated on the Plant Village dataset. Key hyperparameters, including learning rate, batch size, dropout ratio, and regularization factor, were optimized through a thorough grid search. Early stopping based on validation loss was employed to prevent overfitting. The PDLC-ViT model demonstrated significant improvements in plant disease localization and classification tasks. The integration of co-scale, co-attention, and cross-attention mechanisms allowed the model to capture multi-scale dependencies and enhance feature learning, leading to superior performance compared to existing models. The PDLC-ViT model evaluated on two public datasets achieved an accuracy of 99.97%, a Mean Average Precision (MAP) of 99.18%, and a Mean Average Recall (MAR) of 99.11%. These results underscore the model's exceptional precision and recall, highlighting its robustness and reliability in detecting and classifying plant diseases. The PDLC-ViT model sets a new benchmark in plant disease detection, offering a reliable and advanced tool for agricultural applications. Its ability to integrate localization and classification tasks within an MTL framework promotes timely and accurate disease management, contributing to sustainable agriculture and food security.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Multitask Learning-Based Reliability Analysis for Hyperspectral Target Detection
    Zhang, Yuxiang
    Wu, Ke
    Du, Bo
    Hu, Xiangyun
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (07) : 2135 - 2147
  • [42] Deep learning-based common skin disease image classification
    Nath, Sudarshan
    Das Gupta, Suparna
    Saha, Soumyabrata
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 7483 - 7499
  • [43] Deep Learning-Based Segmentation in Classification of Alzheimer’s Disease
    P. R. Buvaneswari
    R. Gayathri
    [J]. Arabian Journal for Science and Engineering, 2021, 46 : 5373 - 5383
  • [44] Deep Learning-Based Segmentation in Classification of Alzheimer's Disease
    Buvaneswari, P. R.
    Gayathri, R.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (06) : 5373 - 5383
  • [45] Manifold Regularized Multitask Feature Learning for Multimodality Disease Classification
    Jie, Biao
    Zhang, Daoqiang
    Cheng, Bo
    Shen, Dinggang
    [J]. HUMAN BRAIN MAPPING, 2015, 36 (02) : 489 - 507
  • [46] Machine learning-based intelligent localization technique for channel classification in massive MIMO
    Ghrabat, Fadhil
    Zhu, Huiling
    Wang, Jiangzhou
    [J]. Discover Internet of Things, 2024, 4 (01):
  • [47] Deep Learning-Based Regression and Classification for Automatic Landmark Localization in Medical Images
    Noothout, Julia M. H.
    de Vos, Bob D.
    Wolterink, Jelmer M.
    Postma, Elbrich M.
    Smeets, Paul A. M.
    Takx, Richard A. P.
    Leiner, Tim
    Viergever, Max A.
    Isgum, Ivana
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (12) : 4011 - 4022
  • [48] Deep learning-based classification of protein subcellular localization from immunohistochemistry images
    Hu, Jin-Xian
    Xu, Ying-Ying
    Yang-Yang
    Shen, Hong-Bin
    [J]. PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 599 - 604
  • [49] A vision and learning-based indoor localization and semantic mapping framework for facility operations and management
    Wei, Yujie
    Akinci, Burcu
    [J]. AUTOMATION IN CONSTRUCTION, 2019, 107
  • [50] Transfer Learning-Based Deep Learning Model for Corn Leaf Disease Classification
    An, Justin
    Zhang, Nian
    Mahmoud, Wagdy H.
    [J]. ADVANCES IN NEURAL NETWORKS-ISNN 2024, 2024, 14827 : 163 - 173