Semantic Segmentation of Synthetic Aperture Radar Images Based on U-Net and Capsule Network

被引:0
|
作者
Jing Shaodi [1 ]
Yu Lingjuan [1 ]
Hu Yuehong [2 ]
Yang Zezhou [1 ]
Lu Zhongliang [1 ]
Xie Xiaochun [3 ]
机构
[1] Jiangxi Univ Sci & Technol, Sch Informat Engn, Ganzhou 341000, Jiangxi, Peoples R China
[2] Guangzhou Wayful Technol Dev Co Ltd, Guangzhou 510200, Guangdong, Peoples R China
[3] Gannan Normal Univ, Sch Phys & Elect Informat, Ganzhou 341000, Jiangxi, Peoples R China
关键词
image processing; synthetic aperture radar; image semantic segmentation; U-Net; capsule network; transfer learning;
D O I
10.3788/LOP202158.2010009
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As a pixel-level classification technique, image semantic segmentation has been employed in the field of synthetic aperture radar (SAR) image interpretations. U-Net is an end-to--end image semantic segmentation network with a typical encoder-decoder architecture. Among them, the coding part mainly comprises a convolutional layer and a pooling layer, which can effectively extract the features of a target image; however, extracting information such as the target position and direction is difficult. Capsule network is a type of neural network that can obtain the target pose (position, size, and direction) and other information. Therefore, this study proposes an SAR image semantic segmentation method based on the U-Net and capsule network. Moreover, considering the small data set of SAR images, the U-Net encoder is designed to be identical to the visual geometry group (VGG16) to allow the trained VGG16 model to be directly transferred to the encoder. The effectiveness of the method is verified by conducting a segmentation experiment of building targets on two polarimetric SAR image data sets. Results show that the method can achieve improved precision, recall, F1-score, and intersection over union as well as reduce the training time of the network model when compared with the U-Net.
引用
收藏
页数:10
相关论文
共 27 条
  • [1] Recurrent residual U-Net for medical image segmentation
    Alom, Md Zahangir
    Yakopcic, Chris
    Hasan, Mahmudul
    Taha, Tarek M.
    Asari, Vijayan K.
    [J]. JOURNAL OF MEDICAL IMAGING, 2019, 6 (01)
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] An Active Deep Learning Approach for Minimally Supervised PolSAR Image Classification
    Bi, Haixia
    Xu, Feng
    Wei, Zhiqiang
    Xue, Yong
    Xu, Zongben
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 9378 - 9395
  • [4] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [5] ChenJ K, 2019, SCI SINICA INFORM, V49, P1606
  • [6] Gao M Y, 2016, SAR IMAGE SEGMENTATI
  • [7] Guo YH, 2015, INT GEOSCI REMOTE SE, P1841, DOI 10.1109/IGARSS.2015.7326150
  • [8] Hu T, 2019, CHIN J LASERS, V46, DOI DOI 10.3788/CJL201916.0210001
  • [9] [黄刚 Huang Gang], 2019, [测绘科学, Science of Surveying and Mapping], V44, P166
  • [10] Wishart Deep Stacking Network for Fast POLSAR Image Classification
    Jiao, Licheng
    Liu, Fang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3273 - 3286