Weeds Classification with Deep Learning: An Investigation Using CNN, Vision Transformers, Pyramid Vision Transformers, and Ensemble Strategy

被引:1
|
作者
Rozendo, Guilherme Botazzo [1 ,4 ]
Roberto, Guilherme Freire [2 ]
Zanchetta do Nascimento, Marcelo [3 ]
Neves, Leandro Alves [4 ]
Lumini, Alessandra [1 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn DISI, Bologna, Italy
[2] Univ Porto FEUP, Fac Engn, Porto, Portugal
[3] Fed Univ Uberlandia UFU, Fac Comp Sci FACOM, Uberlandia, MG, Brazil
[4] Sao Paulo State Univ, Dept Comp Sci & Stat DCCE, Sao Paulo, Brazil
关键词
Weeds classification; CNN; Pyramid Vision Transformers; Vision transformers; Ensemble;
D O I
10.1007/978-3-031-49018-7_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weeds are a significant threat to agricultural production. Weed classification systems based on image analysis have offered innovative solutions to agricultural problems, with convolutional neural networks (CNNs) playing a pivotal role in this task. However, CNNs are limited in their ability to capture global relationships in images due to their localized convolutional operation. Vision Transformers (ViT) and Pyramid Vision Transformers (PVT) have emerged as viable solutions to overcome this limitation. Our study aims to determine the effectiveness of CNN, PVT, and ViT in classifying weeds in image datasets. We also examine if combining these methods in an ensemble can enhance classification performance. Our tests were conducted on significant agricultural datasets, including DeepWeeds and CottonWeedID15. The results indicate that a maximum of 3 methods in an ensemble, with only 15 epochs in training, can achieve high accuracy rates of up to 99.17%. This study demonstrates that high accuracies can be achieved with ease of implementation and only a few epochs.
引用
下载
收藏
页码:229 / 243
页数:15
相关论文
共 50 条
  • [21] Learning Expressive Prompting With Residuals for Vision Transformers
    Das, Rajshekhar
    Dukler, Yonatan
    Ravichandran, Avinash
    Swarninathan, Ashwin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3366 - 3377
  • [22] A survey of the vision transformers and their CNN-transformer based variants
    Khan, Asifullah
    Raufu, Zunaira
    Sohail, Anabia
    Khan, Abdul Rehman
    Asif, Hifsa
    Asif, Aqsa
    Farooq, Umair
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL3) : S2917 - S2970
  • [23] A survey of the vision transformers and their CNN-transformer based variants
    Asifullah Khan
    Zunaira Rauf
    Anabia Sohail
    Abdul Rehman Khan
    Hifsa Asif
    Aqsa Asif
    Umair Farooq
    Artificial Intelligence Review, 2023, 56 : 2917 - 2970
  • [24] A Random Ensemble of Encrypted Vision Transformers for Adversarially Robust Defense
    Iijima, Ryota
    Shiota, Sayaka
    Kiya, Hitoshi
    IEEE ACCESS, 2024, 12 : 69206 - 69216
  • [25] Vision Transformers for Breast Cancer Histology Image Classification
    Baroni, Giulia L.
    Rasotto, Laura
    Roitero, Kevin
    Siraj, Ameer Hamza
    Della Mea, Vincenzo
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 15 - 26
  • [26] CellViT: Vision Transformers for precise cell segmentation and classification
    Hoerst, Fabian
    Rempe, Moritz
    Heine, Lukas
    Seibold, Constantin
    Keyl, Julius
    Baldini, Giulia
    Ugurel, Selma
    Siveke, Jens
    Gruenwald, Barbara
    Egger, Jan
    Kleesiek, Jens
    MEDICAL IMAGE ANALYSIS, 2024, 94
  • [27] Quantum Vision Transformers for Quark-Gluon Classification
    Comajoan Cara, Marcal
    Dahale, Gopal Ramesh
    Dong, Zhongtian
    Forestano, Roy T.
    Gleyzer, Sergei
    Justice, Daniel
    Kong, Kyoungchul
    Magorsch, Tom
    Matchev, Konstantin T.
    Matcheva, Katia
    Unlu, Eyup B.
    AXIOMS, 2024, 13 (05)
  • [28] The classification of the bladder cancer based on Vision Transformers (ViT)
    Ola S. Khedr
    Mohamed E. Wahed
    Al-Sayed R. Al-Attar
    E. A. Abdel-Rehim
    Scientific Reports, 13
  • [29] Image forgery classification and localization through vision transformers
    Digambar Pawar
    Raghavendra Gowda
    Krishna Chandra
    International Journal of Multimedia Information Retrieval, 2025, 14 (1)
  • [30] Vision Transformers Based Classification for Glaucomatous Eye Condition
    Wassel, Moustafa
    Hamdi, Ahmed M.
    Adly, Noha
    Torki, Marwan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 5082 - 5088