SpemNet: A Cotton Disease and Pest Identification Method Based on Efficient Multi-Scale Attention and Stacking Patch Embedding

被引:1
|
作者
Qiu, Keyuan [1 ]
Zhang, Yingjie [1 ]
Ren, Zekai [1 ]
Li, Meng [2 ]
Wang, Qian [3 ]
Feng, Yiqiang [4 ,5 ]
Chen, Feng [1 ]
机构
[1] Shihezi Univ, Coll Informat Sci & Technol, Shihezi 832003, Peoples R China
[2] Univ York, Dept Comp Sci, Heslington YO10 5DD, England
[3] Univ Durham, Dept Comp Sci, Durham DH1 3LE, England
[4] Cent Univ Finance & Econ, Sch Informat, Beijing 100081, Peoples R China
[5] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON M5S 2E8, Canada
关键词
cotton pest recognition; image classification; attention mechanism; transformer; efficient multi-scale attention; feature fusion; deep learning; NETWORK;
D O I
10.3390/insects15090667
中图分类号
Q96 [昆虫学];
学科分类号
摘要
Simple Summary Cotton is a crucial economic crop, but it is often threatened by various pests and diseases during its growth, significantly impacting its yield and quality. Earlier image classification methods often suffer from low accuracy and struggle to perform effectively in complex real-world environments. This paper proposes a novel image classification network named SpemNet, specifically designed for cotton pest and disease recognition. By introducing the Efficient Multi-Scale Attention (EMA) module and the Stacking Patch Embedding (SPE) module, the network enhances the ability to learn local features and integrate multi-scale information, thereby significantly improving the accuracy and efficiency of cotton pest and disease recognition. Extensive experiments conducted on the publicly available CottonInsect and IP102 datasets, as well as a self-collected cotton leaf disease dataset, demonstrate that SpemNet exhibits significant advantages in key metrics such as precision, recall, and F1 score, confirming its effectiveness and superiority in the task of cotton pest and disease recognition.Abstract We propose a cotton pest and disease recognition method, SpemNet, based on efficient multi-scale attention and stacking patch embedding. By introducing the SPE module and the EMA module, we successfully solve the problems of local feature learning difficulty and insufficient multi-scale feature integration in the traditional Vision Transformer model, which significantly improve the performance and efficiency of the model. In our experiments, we comprehensively validate the SpemNet model on the CottonInsect dataset, and the results show that SpemNet performs well in the cotton pest recognition task, with significant effectiveness and superiority. The SpemNet model excels in key metrics such as precision and F1 score, demonstrating significant potential and superiority in the cotton pest and disease recognition task. This study provides an efficient and reliable solution in the field of cotton pest and disease identification, which is of great theoretical and applied significance.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A Study on Maize Leaf Pest and Disease Detection Model Based on Attention and Multi-Scale Features
    Kang, Jie
    Zhang, Wanhu
    Xia, Yu
    Liu, Wenbo
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [2] A CNNA-Based Lightweight Multi-Scale Tomato Pest and Disease Classification Method
    Xu, Yanlei
    Gao, Zhiyuan
    Zhai, Yuting
    Wang, Qi
    Gao, Zongmei
    Xu, Zhao
    Zhou, Yang
    SUSTAINABILITY, 2023, 15 (11)
  • [3] AAANE: Attention-Based Adversarial Autoencoder for Multi-scale Network Embedding
    Sang, Lei
    Xu, Min
    Qian, Shengsheng
    Wu, Xindong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT III, 2019, 11441 : 3 - 14
  • [4] Multi-Scale and Multi-Factor ViT Attention Model for Classification and Detection of Pest and Disease in Agriculture
    Xie, Mingyao
    Ye, Ning
    APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [5] Research on Multi-Scale Pest Detection and Identification Method in Granary Based on Improved YOLOv5
    Chu, Jinyu
    Li, Yane
    Feng, Hailin
    Weng, Xiang
    Ruan, Yaoping
    AGRICULTURE-BASEL, 2023, 13 (02):
  • [6] An efficient multi-scale channel attention network for person re-identification
    Qian Luo
    Jie Shao
    Wanli Dang
    Long Geng
    Huaiyu Zheng
    Chang Liu
    The Visual Computer, 2024, 40 : 3515 - 3527
  • [7] An Efficient Multi-Scale Focusing Attention Network for Person Re-Identification
    Huang, Wei
    Li, Yongying
    Zhang, Kunlin
    Hou, Xiaoyu
    Xu, Jihui
    Su, Ruidan
    Xu, Huaiyu
    APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 16
  • [8] An efficient multi-scale channel attention network for person re-identification
    Luo, Qian
    Shao, Jie
    Dang, Wanli
    Geng, Long
    Zheng, Huaiyu
    Liu, Chang
    VISUAL COMPUTER, 2024, 40 (05): : 3515 - 3527
  • [9] Multi-scale hand segmentation method based on attention mechanism
    Zhou, Wenqing
    Dai, Sumin
    Wang, Yangpin
    Wang, Wenrun
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (11) : 1506 - 1518
  • [10] MULTI-SCALE SPEAKER EMBEDDING-BASED GRAPH ATTENTION NETWORKS FOR SPEAKER DIARISATION
    Kwon, Youngki
    Heo, Hee-Soo
    Jung, Jee-Weon
    Kim, You Jin
    Lee, Bong-Jin
    Chung, Joon Son
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8367 - 8371