Weeds Classification with Deep Learning: An Investigation Using CNN, Vision Transformers, Pyramid Vision Transformers, and Ensemble Strategy

被引：1

作者：

Rozendo, Guilherme Botazzo ^{[1
,4
]}

Roberto, Guilherme Freire ^{[2
]}

Zanchetta do Nascimento, Marcelo ^{[3
]}

Neves, Leandro Alves ^{[4
]}

Lumini, Alessandra ^{[1
]}

机构：

[1] Univ Bologna, Dept Comp Sci & Engn DISI, Bologna, Italy

[2] Univ Porto FEUP, Fac Engn, Porto, Portugal

[3] Fed Univ Uberlandia UFU, Fac Comp Sci FACOM, Uberlandia, MG, Brazil

[4] Sao Paulo State Univ, Dept Comp Sci & Stat DCCE, Sao Paulo, Brazil

来源：

PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I | 2024年 / 14469卷

关键词：

Weeds classification; CNN; Pyramid Vision Transformers; Vision transformers; Ensemble;

D O I：

10.1007/978-3-031-49018-7_17

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weeds are a significant threat to agricultural production. Weed classification systems based on image analysis have offered innovative solutions to agricultural problems, with convolutional neural networks (CNNs) playing a pivotal role in this task. However, CNNs are limited in their ability to capture global relationships in images due to their localized convolutional operation. Vision Transformers (ViT) and Pyramid Vision Transformers (PVT) have emerged as viable solutions to overcome this limitation. Our study aims to determine the effectiveness of CNN, PVT, and ViT in classifying weeds in image datasets. We also examine if combining these methods in an ensemble can enhance classification performance. Our tests were conducted on significant agricultural datasets, including DeepWeeds and CottonWeedID15. The results indicate that a maximum of 3 methods in an ensemble, with only 15 epochs in training, can achieve high accuracy rates of up to 99.17%. This study demonstrates that high accuracies can be achieved with ease of implementation and only a few epochs.

引用

下载

页码：229 / 243

页数：15

共 50 条

[21] Learning Expressive Prompting With Residuals for Vision Transformers
Das, Rajshekhar
Dukler, Yonatan
Ravichandran, Avinash
Swarninathan, Ashwin
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3366 - 3377
[22] A survey of the vision transformers and their CNN-transformer based variants
Khan, Asifullah
Raufu, Zunaira
Sohail, Anabia
Khan, Abdul Rehman
Asif, Hifsa
Asif, Aqsa
Farooq, Umair
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL3) : S2917 - S2970
[23] A survey of the vision transformers and their CNN-transformer based variants
Asifullah Khan
Zunaira Rauf
Anabia Sohail
Abdul Rehman Khan
Hifsa Asif
Aqsa Asif
Umair Farooq
Artificial Intelligence Review, 2023, 56 : 2917 - 2970
[24] A Random Ensemble of Encrypted Vision Transformers for Adversarially Robust Defense
Iijima, Ryota
Shiota, Sayaka
Kiya, Hitoshi
IEEE ACCESS, 2024, 12 : 69206 - 69216
[25] Vision Transformers for Breast Cancer Histology Image Classification
Baroni, Giulia L.
Rasotto, Laura
Roitero, Kevin
Siraj, Ameer Hamza
Della Mea, Vincenzo
IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 15 - 26
[26] CellViT: Vision Transformers for precise cell segmentation and classification
Hoerst, Fabian
Rempe, Moritz
Heine, Lukas
Seibold, Constantin
Keyl, Julius
Baldini, Giulia
Ugurel, Selma
Siveke, Jens
Gruenwald, Barbara
Egger, Jan
Kleesiek, Jens
MEDICAL IMAGE ANALYSIS, 2024, 94
[27] Quantum Vision Transformers for Quark-Gluon Classification
Comajoan Cara, Marcal
Dahale, Gopal Ramesh
Dong, Zhongtian
Forestano, Roy T.
Gleyzer, Sergei
Justice, Daniel
Kong, Kyoungchul
Magorsch, Tom
Matchev, Konstantin T.
Matcheva, Katia
Unlu, Eyup B.
AXIOMS, 2024, 13 (05)
[28] The classification of the bladder cancer based on Vision Transformers (ViT)
Ola S. Khedr
Mohamed E. Wahed
Al-Sayed R. Al-Attar
E. A. Abdel-Rehim
Scientific Reports, 13
[29] Image forgery classification and localization through vision transformers
Digambar Pawar
Raghavendra Gowda
Krishna Chandra
International Journal of Multimedia Information Retrieval, 2025, 14 (1)
[30] Vision Transformers Based Classification for Glaucomatous Eye Condition
Wassel, Moustafa
Hamdi, Ahmed M.
Adly, Noha
Torki, Marwan
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 5082 - 5088

← 1 2 3 4 5 →