Few-Shot Aerial Image Semantic Segmentation Leveraging Pyramid Correlation Fusion

被引:1
|
作者
Ao, Wei [1 ]
Zheng, Shunyi [1 ]
Meng, Yan [2 ]
Gao, Zhi [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
[2] Hubei Univ, Sch Artificial Intelligence, Wuhan 430062, Peoples R China
关键词
Distance correlation; few-shot semantic segmentation (FSS); meta-learning; remote-sensing image processing; semantic correspondence; DEEP; NETWORK; CLASSIFICATION; AGGREGATION;
D O I
10.1109/TGRS.2023.3328339
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot semantic segmentation (FSS) has gained significant attention due to its ability to segment novel objects using only a limited number of labeled samples, thereby addressing the problem of overfitting caused by a lack of training data. Although this technique is widely studied in the field of computer vision, there are few methods for remote-sensing images. Prevalent FSS methods can achieve remarkable results for natural images, but they are difficult to apply to remote-sensing image processing because existing methods rarely take into consideration the large-scale and resolution differences in remote-sensing images. Consequently, it is hard for them to obtain correct semantic guidance from a few annotated remote-sensing images. To tackle these problems, this article proposes the pyramid correlation fusion network (PCFNet) to promote the ability to mine helpful information by calculating multiscale pixel-wise semantic correspondence. Particularly, the dual-distance correlation (DDC) module is designed to simultaneously compute the cosine similarity and Euclidean distance between query features and support features, producing adequate guidance information to determine the category of each pixel. Moreover, to improve segmentation accuracy for small objects, the scale-aware cross-entropy loss (SACELoss) is introduced to dynamically assign loss weights according to the actual sizes of objects. This enables smaller objects to be assigned larger weight values and thus receive more attention during training. Comprehensive experiments on both the iSAID- 5(i) and DLRSD- 5(i) datasets demonstrate that our method outperforms state-of-the-art FSS methods. Our code is available at https://github.com/TinyAway/PCFNet.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [1] Few-Shot Rotation-Invariant Aerial Image Semantic Segmentation
    Cao, Qinglong
    Chen, Yuntian
    Ma, Chao
    Yang, Xiaokang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [2] Scale-Aware Detailed Matching for Few-Shot Aerial Image Semantic Segmentation
    Yao, Xiwen
    Cao, Qinglong
    Feng, Xiaoxu
    Cheng, Gong
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Survey on Image Semantic Segmentation in Dilemma of Few-Shot
    Wei, Ting
    Li, Xinlei
    Liu, Hui
    [J]. Computer Engineering and Applications, 2024, 59 (02) : 1 - 11
  • [4] FFNet: Feature Fusion Network for Few-shot Semantic Segmentation
    Wang, Ya-Nan
    Tian, Xiangtao
    Zhong, Guoqiang
    [J]. COGNITIVE COMPUTATION, 2022, 14 (02) : 875 - 886
  • [5] FFNet: Feature Fusion Network for Few-shot Semantic Segmentation
    Ya-Nan Wang
    Xiangtao Tian
    Guoqiang Zhong
    [J]. Cognitive Computation, 2022, 14 : 875 - 886
  • [6] PANet: Few-Shot Image Semantic Segmentation with Prototype Alignment
    Wang, Kaixin
    Liew, Jun Hao
    Zou, Yingtian
    Zhou, Daquan
    Feng, Jiashi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9196 - 9205
  • [7] Generalized Few-shot Semantic Segmentation
    Tian, Zhuotao
    Lai, Xin
    Jiang, Li
    Liu, Shu
    Shu, Michelle
    Zhao, Hengshuang
    Jia, Jiaya
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
  • [8] Interactive Fusion and Correlation Network for Three-Modal Images Few-Shot Semantic Segmentation
    He, Haolan
    Dong, Xianguo
    Zhou, Xiaofei
    Wang, Bo
    Zhang, Jiyong
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2430 - 2434
  • [9] Visible and thermal images fusion architecture for few-shot semantic segmentation
    Bao, Yanqi
    Song, Kechen
    Wang, Jie
    Huang, Liming
    Dong, Hongwen
    Yan, Yunhui
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
  • [10] Query-support semantic correlation mining for few-shot segmentation
    Shao, Ji
    Gong, Bo
    Dai, Kanyuan
    Li, Daoliang
    Jing, Ling
    Chen, Yingyi
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126