Weeds Classification with Deep Learning: An Investigation Using CNN, Vision Transformers, Pyramid Vision Transformers, and Ensemble Strategy

被引:1
|
作者
Rozendo, Guilherme Botazzo [1 ,4 ]
Roberto, Guilherme Freire [2 ]
Zanchetta do Nascimento, Marcelo [3 ]
Neves, Leandro Alves [4 ]
Lumini, Alessandra [1 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn DISI, Bologna, Italy
[2] Univ Porto FEUP, Fac Engn, Porto, Portugal
[3] Fed Univ Uberlandia UFU, Fac Comp Sci FACOM, Uberlandia, MG, Brazil
[4] Sao Paulo State Univ, Dept Comp Sci & Stat DCCE, Sao Paulo, Brazil
关键词
Weeds classification; CNN; Pyramid Vision Transformers; Vision transformers; Ensemble;
D O I
10.1007/978-3-031-49018-7_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weeds are a significant threat to agricultural production. Weed classification systems based on image analysis have offered innovative solutions to agricultural problems, with convolutional neural networks (CNNs) playing a pivotal role in this task. However, CNNs are limited in their ability to capture global relationships in images due to their localized convolutional operation. Vision Transformers (ViT) and Pyramid Vision Transformers (PVT) have emerged as viable solutions to overcome this limitation. Our study aims to determine the effectiveness of CNN, PVT, and ViT in classifying weeds in image datasets. We also examine if combining these methods in an ensemble can enhance classification performance. Our tests were conducted on significant agricultural datasets, including DeepWeeds and CottonWeedID15. The results indicate that a maximum of 3 methods in an ensemble, with only 15 epochs in training, can achieve high accuracy rates of up to 99.17%. This study demonstrates that high accuracies can be achieved with ease of implementation and only a few epochs.
引用
下载
收藏
页码:229 / 243
页数:15
相关论文
共 50 条
  • [41] GRAPEVINE VARIETIES IDENTIFICATION USING VISION TRANSFORMERS
    Carneiro, Gabriel Antonio
    Padua, Luis
    Peres, Emanuel
    Morais, Raul
    Sousa, Joaquim J.
    Cunha, Antonio
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 5866 - 5869
  • [42] From modern CNNs to vision transformers: Assessing the performance, robustness, and classification strategies of deep learning models in histopathology
    Springenberg, Maximilian
    Frommholz, Annika
    Wenzel, Markus
    Weicken, Eva
    Ma, Jackie
    Strodthoff, Nils
    MEDICAL IMAGE ANALYSIS, 2023, 87
  • [43] An arabic visual speech recognition framework with CNN and vision transformers for lipreading
    Baaloul, Ali
    Benblidia, Nadjia
    Reguieg, Fatma Zohra
    Bouakkaz, Mustapha
    Felouat, Hisham
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (27) : 69989 - 70023
  • [44] Hyperbolic Vision Transformers: Combining Improvements in Metric Learning
    Ermolov, Aleksandr
    Mirvakhabova, Leyla
    Khrulkov, Valentin
    Sebe, Nicu
    Oseledets, Ivan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7399 - 7409
  • [45] Automated Progressive Learning for Efficient Training of Vision Transformers
    Li, Changlin
    Zhuang, Bohan
    Wang, Guangrun
    Liang, Xiaodan
    Chang, Xiaojun
    Yang, Yi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12476 - 12486
  • [46] Preserving Locality in Vision Transformers for Class Incremental Learning
    Zheng, Bowen
    Zhou, Wei
    Ye, Han-Jia
    Zhan, De-Chuan
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1157 - 1162
  • [47] Learning of Generic Vision Features using Deep CNN
    Nithin, Kanishka D.
    Sivakumar, Bagavathi P.
    2015 FIFTH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS (ICACC), 2015, : 54 - 57
  • [48] On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers
    De Min, Thomas
    Mancini, Massimiliano
    Alahari, Karteek
    Alameda-Pineda, Xavier
    Ricci, Elisa
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3577 - 3586
  • [49] Automatic Detection and Classification of Cardiovascular Disorders Using Phonocardiogram and Convolutional Vision Transformers
    Abbas, Qaisar
    Hussain, Ayyaz
    Baig, Abdul Rauf
    DIAGNOSTICS, 2022, 12 (12)
  • [50] GA-based weighted ensemble learning for multi-label aerial image classification using convolutional neural networks and vision transformers
    Tseng, Ming-Hseng
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):