Local Reversible Transformer for semantic segmentation of grape leaf diseases

被引:8
|
作者
Zhang, Xinxin [1 ,2 ]
Li, Fei [1 ]
Jin, Haibin [1 ,2 ]
Mu, Weisong [1 ,2 ,3 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr, Key Lab Viticulture & Enol, Beijing 100083, Peoples R China
[3] China Agr Univ, POB 121,17 Tsinghua East Rd, Beijing 100083, Peoples R China
关键词
Local learning bottleneck; Reversible downsampling; Grape leaf diseases; Semantic segmentation;
D O I
10.1016/j.asoc.2023.110392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grape leaf diseases segmentation is an essential basis for achieving precise diagnosis and identification of diseases. However, the complex background renders it difficult for small disease areas to be precisely segmented. The existing Transformer mainly focuses on utilizing key and value downsampling to improve model performance while neglecting that downsampling is irreversible with the loss of contextual information. To this end, this paper proposed a novel Locally Reversible Transformer (LRT) segmentation model for grape leaf diseases in natural scene images, whose representation is learned in a reversible downsampling manner. Specifically, a Local Learning Bottleneck (LLB) is developed to enhance local perception and extract richer semantic information of grape leaf diseases via inverted residual convolution. Furthermore, motivated by the wavelet theory, the Reversible Attention (RA) is designed to replace the original downsampling operation by introducing wavelet transform into the multi-headed attention and solving the problem of difficult detection and segmentation of small disease targets with complex backgrounds. Extensive experiments demonstrate that the segmentation performance of LRT outperforms state-of-the-art models with comparable GFLOPs and parameters. Moreover, LRT can retain more multi-grain information and can increase the receptive field to focus on small disease regions with complex backgrounds.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Evaluating Transformer-based Semantic Segmentation Networks for Pathological Image Segmentation
    Cam Nguyen
    Asad, Zuhayr
    Deng, Ruining
    Huo, Yuankai
    MEDICAL IMAGING 2022: IMAGE PROCESSING, 2022, 12032
  • [42] Litchi Flower and Leaf Segmentation and Recognition Based on Deep Semantic Segmentation
    Xiong J.
    Liu B.
    Zhong Z.
    Chen S.
    Zheng Z.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2021, 52 (06): : 252 - 258
  • [43] LDD: A Grape Diseases Dataset Detection and Instance Segmentation
    Rossi, Leonardo
    Valenti, Marco
    Legler, Sara Elisabetta
    Prati, Andrea
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 383 - 393
  • [44] Fourier Domain Adaptation for the Identification of Grape Leaf Diseases
    Wang, Jing
    Wu, Qiufeng
    Liu, Tianci
    Wang, Yuqi
    Li, Pengxian
    Yuan, Tianhao
    Ji, Ziyang
    APPLIED SCIENCES-BASEL, 2024, 14 (09):
  • [45] Grape Guard: A YOLO-based mobile application for detecting grape leaf diseases
    Sajib Bin Mamun
    Israt Jahan Payel
    MdTaimur Ahad
    Anthony SAtkins
    Bo Song
    Yan Li
    Journal of Electronic Science and Technology, 2025, 23 (01) : 62 - 77
  • [46] Grape Guard: A YOLO-based mobile application for detecting grape leaf diseases
    Mamun, Sajib Bin
    Payel, Israt Jahan
    Ahad, Md Taimur
    Atkins, Anthony S.
    Song, Bo
    Li, Yan
    Journal of Electronic Science and Technology, 2025, 23 (01)
  • [47] ETFT: Equiangular Tight Frame Transformer for Imbalanced Semantic Segmentation
    Jeong, Seonggyun
    Heo, Yong Seok
    SENSORS, 2024, 24 (21)
  • [48] A lightweight siamese transformer for few-shot semantic segmentation
    Zhu, Hegui
    Zhou, Yange
    Jiang, Cong
    Yang, Lianping
    Jiang, Wuming
    Wang, Zhimu
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7455 - 7469
  • [49] Cross-scale sampling transformer for semantic image segmentation
    Ma, Yizhe
    Yu, Long
    Lin, Fangjian
    Tian, Shengwei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 2895 - 2907
  • [50] An Enhanced Downsampling Transformer Network for Point Cloud Semantic Segmentation
    Wang, Yang
    Wei, Zixuan
    Wan, Zhibo
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 262 - 269