A Multitask Learning-Based Vision Transformer for Plant Disease Localization and Classification

被引:0
|
作者
Hemalatha, S. [1 ]
Jayachandran, Jai Jaganath Babu [2 ]
机构
[1] Rajalakshmi Engn Coll, Dept Artificial Intelligence & Machine Learning, Chennai 602105, India
[2] Chennai Inst Technol, Dept Biomed Engn, Chennai 600069, India
关键词
Plant disease; Classification; Localization; Vision transformer; Multi-task learning;
D O I
10.1007/s44196-024-00597-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Plant disease detection is a critical task in agriculture, essential for ensuring crop health and productivity. Traditional methods in this context are often labor-intensive and prone to errors, highlighting the need for automated solutions. While computer vision-based solutions have been successfully deployed in recent years for plant disease identification and localization tasks, these often operate independently, leading to suboptimal performance. It is essential to develop an integrated solution combining these two tasks for improved efficiency and accuracy. This research proposes the innovative Plant Disease Localization and Classification model based on Vision Transformer (PDLC-ViT), which integrates co-scale, co-attention, and cross-attention mechanisms and a ViT, within a Multi-Task Learning (MTL) framework. The model was trained and evaluated on the Plant Village dataset. Key hyperparameters, including learning rate, batch size, dropout ratio, and regularization factor, were optimized through a thorough grid search. Early stopping based on validation loss was employed to prevent overfitting. The PDLC-ViT model demonstrated significant improvements in plant disease localization and classification tasks. The integration of co-scale, co-attention, and cross-attention mechanisms allowed the model to capture multi-scale dependencies and enhance feature learning, leading to superior performance compared to existing models. The PDLC-ViT model evaluated on two public datasets achieved an accuracy of 99.97%, a Mean Average Precision (MAP) of 99.18%, and a Mean Average Recall (MAR) of 99.11%. These results underscore the model's exceptional precision and recall, highlighting its robustness and reliability in detecting and classifying plant diseases. The PDLC-ViT model sets a new benchmark in plant disease detection, offering a reliable and advanced tool for agricultural applications. Its ability to integrate localization and classification tasks within an MTL framework promotes timely and accurate disease management, contributing to sustainable agriculture and food security.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] A deep learning based approach for automated plant disease classification using vision transformer
    Borhani, Yasamin
    Khoramdel, Javad
    Najafi, Esmaeil
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [2] A deep learning based approach for automated plant disease classification using vision transformer
    Yasamin Borhani
    Javad Khoramdel
    Esmaeil Najafi
    [J]. Scientific Reports, 12
  • [3] Vision Transformer Adapters for Generalizable Multitask Learning
    Bhattacharjee, Deblina
    Susstrunk, Sabine
    Salzmann, Mathieu
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18969 - 18980
  • [4] Satellite Images Analysis and Classification using Deep Learning-based Vision Transformer Model
    Adegun, Adekanmi Adeyinka
    Viriri, Serestina
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1275 - 1279
  • [5] A multitask classification framework based on vision transformer for predicting molecular expressions of glioma
    Xu, Qian
    Xu, Qian Qian
    Shi, Nian
    Dong, Li Na
    Zhu, Hong
    Xu, Kai
    [J]. EUROPEAN JOURNAL OF RADIOLOGY, 2022, 157
  • [6] A Multitask Learning-Based Model for Gas Classification and Concentration Prediction
    Dai, Yang
    Xiong, Yin
    Lin, He
    Li, Yunlong
    Feng, Yunhao
    Luo, Wan
    Zhong, Xiaojiang
    [J]. IEEE SENSORS JOURNAL, 2024, 24 (07) : 11639 - 11650
  • [7] End-to-End Multitask Learning With Vision Transformer
    Tian, Yingjie
    Bai, Kunlong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9579 - 9590
  • [8] A Deep Features Extraction Model Based on the Transfer Learning Model and Vision Transformer "TLMViT" for Plant Disease Classification
    Tabbakh, Amer
    Barpanda, Soubhagya Sankar
    [J]. IEEE ACCESS, 2023, 11 : 45377 - 45392
  • [9] Deep learning-based plant classification and crop disease classification by thermal camera
    Batchuluun, Ganbayar
    Nam, Se Hyun
    Park, Kang Ryoung
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 10474 - 10486
  • [10] Systematic study on deep learning-based plant disease detection or classification
    Sunil, C. K.
    Jaidhar, C. D.
    Patil, Nagamma
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (12) : 14955 - 15052