Local Reversible Transformer for semantic segmentation of grape leaf diseases

被引:8
|
作者
Zhang, Xinxin [1 ,2 ]
Li, Fei [1 ]
Jin, Haibin [1 ,2 ]
Mu, Weisong [1 ,2 ,3 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr, Key Lab Viticulture & Enol, Beijing 100083, Peoples R China
[3] China Agr Univ, POB 121,17 Tsinghua East Rd, Beijing 100083, Peoples R China
关键词
Local learning bottleneck; Reversible downsampling; Grape leaf diseases; Semantic segmentation;
D O I
10.1016/j.asoc.2023.110392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grape leaf diseases segmentation is an essential basis for achieving precise diagnosis and identification of diseases. However, the complex background renders it difficult for small disease areas to be precisely segmented. The existing Transformer mainly focuses on utilizing key and value downsampling to improve model performance while neglecting that downsampling is irreversible with the loss of contextual information. To this end, this paper proposed a novel Locally Reversible Transformer (LRT) segmentation model for grape leaf diseases in natural scene images, whose representation is learned in a reversible downsampling manner. Specifically, a Local Learning Bottleneck (LLB) is developed to enhance local perception and extract richer semantic information of grape leaf diseases via inverted residual convolution. Furthermore, motivated by the wavelet theory, the Reversible Attention (RA) is designed to replace the original downsampling operation by introducing wavelet transform into the multi-headed attention and solving the problem of difficult detection and segmentation of small disease targets with complex backgrounds. Extensive experiments demonstrate that the segmentation performance of LRT outperforms state-of-the-art models with comparable GFLOPs and parameters. Moreover, LRT can retain more multi-grain information and can increase the receptive field to focus on small disease regions with complex backgrounds.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
    Shao, Yilin
    Sun, Long
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 132 - 146
  • [22] MarsFormer: Martian Rock Semantic Segmentation With Transformer
    Xiong, Yonggang
    Xiao, Xueming
    Yao, Meibao
    Liu, Haiqiang
    Yang, Hong
    Fu, Yuegang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [23] Channel selection and local attention transformer model for semantic segmentation on UAV remote sensing scene
    Liu, Da
    Long, Hao
    Liu, Zhenbao
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [24] Swin-Conv-Dspp and Global Local Transformer for Remote Sensing Image Semantic Segmentation
    Mo, Youda
    Li, Huihui
    Xiao, Xiangling
    Zhao, Huimin
    Liu, Xiaoyong
    Zhan, Jin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 5284 - 5296
  • [25] Laformer: Vision Transformer for Panoramic Image Semantic Segmentation
    Yuan, Zheng
    Wang, Junhua
    Lv, Yuxin
    Wang, Ding
    Fang, Yi
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1792 - 1796
  • [26] TransRSS: Transformer-based Radar Semantic Segmentation
    Zou, Hao
    Xie, Zhen
    Ou, Jiarong
    Gao, Yutao
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 6965 - 6972
  • [27] HSPFormer: Hierarchical Spatial Perception Transformer for Semantic Segmentation
    Chen, Siyu
    Han, Ting
    Zhang, Changshe
    Su, Jinhe
    Wang, Ruisheng
    Chen, Yiping
    Wang, Zongyue
    Cai, Guorong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [28] Video Semantic Segmentation via Sparse Temporal Transformer
    Li, Jiangtong
    Wang, Wentao
    Chen, Junjie
    Niu, Li
    Si, Jianlou
    Qian, Chen
    Zhang, Liqing
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 59 - 68
  • [29] Class-Prompting Transformer for Incremental Semantic Segmentation
    Song, Zichen
    Shi, Zhaofeng
    Shang, Chao
    Meng, Fanman
    Xu, Linfeng
    IEEE ACCESS, 2023, 11 : 100154 - 100164
  • [30] TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
    Zhang, Wenqiang
    Huang, Zilong
    Luo, Guozhong
    Chen, Tao
    Wang, Xinggang
    Liu, Wenyu
    Yu, Gang
    Shen, Chunhua
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12073 - 12083