DHT: Deformable Hybrid Transformer for Aerial Image Segmentation

被引:7
|
作者
Zhang, Yan [1 ,2 ]
Gao, Xiyuan [1 ]
Duan, Qingyan [1 ]
Yuan, Lin [1 ]
Gao, Xinbo [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Deformable models; Image segmentation; Computational modeling; Buildings; Object segmentation; Transformers; Sensors; Aerial image segmentation; deep learning; remote sensing; self-attention; transformer;
D O I
10.1109/LGRS.2022.3222916
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Due to the strong ability to model global information, the transformer-based methods have shown remarkable improvements in the image segmentation tasks. However, the self-attention mechanism in the transformer is computationally expensive and relies on pretrained parameters. Moreover, the transformer method is weak in modeling local information, which is unfavorable for accurately segmenting objects from high-resolution aerial images. To this end, an efficient deformable orientational self-attention (DoA) is proposed to simultaneously extract the global information and the local information. Besides, for parameter efficiency, we design a depthwise channel self-attention (DcA) to model the contextual information among channels. Combining with the DoA and DcA, we propose the deformable hybrid transformer (DHT) to perform high-quality object segmentation on aerial images. Experiments on the International Society for Photogrammetry and Remote Sensing (ISPRS) Potsdam and Wuhan University (WHU) building datasets illustrate that the proposed DHT can not only achieve the state-of-the-art (SOTA) results but also markedly reduce the dependence of the transformer on pretrained parameters.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    [J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [2] AerialFormer: Multi-Resolution Transformer for Aerial Image Segmentation
    Hanyu, Taisei
    Yamazaki, Kashu
    Tran, Minh
    McCann, Roy A.
    Liao, Haitao
    Rainwater, Chase
    Adkins, Meredith
    Cothren, Jackson
    Le, Ngan
    [J]. REMOTE SENSING, 2024, 16 (16)
  • [3] Efficient Depth Fusion Transformer for Aerial Image Semantic Segmentation
    Yan, Li
    Huang, Jianming
    Xie, Hong
    Wei, Pengcheng
    Gao, Zhao
    [J]. REMOTE SENSING, 2022, 14 (05)
  • [4] A combined deformable model and medical transformer algorithm for medical image segmentation
    Tang, Zhixian
    Duan, Jintao
    Sun, Yanming
    Zeng, Yanan
    Zhang, Yile
    Yao, Xufeng
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (01) : 129 - 137
  • [5] A combined deformable model and medical transformer algorithm for medical image segmentation
    Zhixian Tang
    Jintao Duan
    Yanming Sun
    Yanan Zeng
    Yile Zhang
    Xufeng Yao
    [J]. Medical & Biological Engineering & Computing, 2023, 61 : 129 - 137
  • [6] UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation
    Gao, Yunhe
    Zhou, Mu
    Metaxas, Dimitris N.
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 61 - 71
  • [7] UDT: U-shaped deformable transformer for subarachnoid haemorrhage image segmentation
    Xie, Wei
    Jin, Lianghao
    Hua, Shiqi
    Sun, Hao
    Sun, Bo
    Tu, Zhigang
    Liu, Jun
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (03) : 756 - 768
  • [8] SEGTRANSVAE: HYBRID CNN - TRANSFORMER WITH REGULARIZATION FOR MEDICAL IMAGE SEGMENTATION
    Quan-Dung Pham
    Hai Nguyen-Truong
    Nam Nguyen Phuong
    Nguyen, Khoa N. A.
    Nguyen, Chanh D. T.
    Bui, Trung
    Truong, Steven Q. H.
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [9] DPNET: END-TO-END AERIAL IMAGE SEGMENTATION VIA DEFORMABLE POINT NETWORK
    Guo, Yiyou
    Qin, Zheyun
    Liu, Yikun
    Yang, Yongtai
    Lu, Xiankai
    Xie, Huan
    Tong, Xiaohua
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3484 - 3487
  • [10] Slimmable transformer with hybrid axial-attention for medical image segmentation
    Hu, Yiyue
    Mu, Nan
    Liu, Lei
    Zhang, Lei
    Jiang, Jingfeng
    Li, Xiaoning
    [J]. Computers in Biology and Medicine, 2024, 173