Coarse-to-fine semantic segmentation of satellite images

被引:2
|
作者
Chen, Hao [1 ]
Yang, Wen [2 ]
Liu, Li [3 ]
Xia, Gui-Song [1 ,4 ,5 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[2] Wuhan Univ, Sch Elect Informat, Wuhan 430079, Peoples R China
[3] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China
[4] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Peoples R China
[5] Wuhan Univ, Inst Artificial Intelligence, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Local-softmax; Multi-prototype learning; Semantic segmentation; COVER; AREA;
D O I
10.1016/j.isprsjprs.2024.07.028
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Training deep neural networks for semantic segmentation of aerial images relies heavily on obtaining a large number of precise pixel-level annotations, which can cause significant annotation expenses. Given the fact that acquiring fine-class annotations is considerably more challenging than obtaining coarse-class annotations, we present a novel semi-supervised learning framework, which utilizes high spatial resolution images annotated with coarse-class labels alongside a very small set of fine-grained annotated images as the training set, thereby achieving classification results that are refined in both spatial resolution and categorical granularity. Specifically, this framework adopts Mix Transformer (MiT) as the backbone architecture to accommodate both local feature extraction and long-range dependency modeling capabilities and utilizes multi-prototype learning to model each class as multiple sub-prototypes, preserving the intrinsic variance characteristics within classes. We propose a dedicated co-training approach tailored for extracting fine-grained pseudo-labels from coarse- grained samples. In this approach, a local-softmax pseudo-labeling strategy is developed to ensure a harmonious balance between the efficiency and accuracy of the pseudo-labeling, and four losses are formulated for both single-level class and cross-category granularity supervised learning. We evaluate the proposed framework on the Gaofen Image Dataset (GID) and Five-Billion-Pixels (FBP) dataset, confirming its feasibility and superior results. In particular, based on coarse-class annotations, the performance achieved using only 5% of fineclass labels, in terms of the four metrics, namely mIoU, mean UA, mean F1-score, and OA, reached 91%, 96%, 89%, and 93% of the fully-supervised baseline performance respectively. The code is available at https://github.com/chenhaocs/C2F.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [1] Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning
    Luo, Yadan
    Wang, Ziwei
    Huang, Zi
    Yang, Yang
    Zhao, Cong
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 237 - 246
  • [2] CasNet:A Cascade Coarse-to-Fine Network for Semantic Segmentation
    Zhenyang Wang
    Zhidong Deng
    Shiyao Wang
    Tsinghua Science and Technology, 2019, 24 (02) : 207 - 215
  • [3] Continual coarse-to-fine domain adaptation in semantic segmentation
    Shenaj, Donald
    Barbato, Francesco
    Michieli, Umberto
    Zanuttigh, Pietro
    IMAGE AND VISION COMPUTING, 2022, 121
  • [4] CasNet: A Cascade Coarse-to-Fine Network for Semantic Segmentation
    Wang, Zhenyang
    Deng, Zhidong
    Wang, Shiyao
    TSINGHUA SCIENCE AND TECHNOLOGY, 2019, 24 (02) : 207 - 215
  • [5] Coarse-to-Fine Feature Mining for Video Semantic Segmentation
    Sun, Guolei
    Liu, Yun
    Ding, Henghui
    Probst, Thomas
    Van Gool, Luc
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3116 - 3127
  • [6] Coarse-to-Fine Particle Segmentation in Microscopic Urinary Images
    Qian, Jiye
    Fang, Bin
    Li, Chunyan
    Chen, Lin
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 1978 - 1981
  • [7] Coarse-to-Fine Lung Segmentation in Computed Tomography Images
    Qiang, Yan
    Ji, Guohua
    Han, Xiaohong
    Zhao, Juanjuan
    Liao, Xiaolei
    Cui, Zhihua
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (02) : 330 - 334
  • [8] CFNet: A Coarse-to-Fine Network for Few Shot Semantic Segmentation
    Liu, Jiade
    Jung, Cheolkon
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [9] Coarse-to-fine Semantic Video Segmentation using Supervoxel Trees
    Jain, Aastha
    Chatterjee, Shaunak
    Vidal, Rene
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1865 - 1872
  • [10] A coarse-to-fine approach to prostate boundary segmentation in ultrasound images
    Sahba, Farhang
    Tizhoosh, Hamid R.
    Salama, Magdy M.
    BIOMEDICAL ENGINEERING ONLINE, 2005, 4 (1)