WaterFormer: A coupled transformer and CNN network for waterbody detection in optical remotely-sensed imagery

被引:11
|
作者
Kang, Jian [1 ]
Guan, Haiyan [1 ]
Ma, Lingfei [2 ]
Wang, Lanying [3 ]
Xu, Zhengsen [1 ]
Li, Jonathan [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Remote Sensing & Geomat Engn, Nanjing 210044, Peoples R China
[2] Cent Univ Finance & Econ, Sch Stat & Math, Beijing 102206, Peoples R China
[3] Univ Waterloo, Dept Geog & Environm Management, Waterloo, ON N2L 3G1, Canada
基金
中国国家自然科学基金;
关键词
Optical remotely-sensed imagery; Convolutional neural networks (CNNs); Visual Transformer; Waterbody detection (WD); Multi-scale feature; Long-range dependency; SURFACE-WATER; BODY EXTRACTION; LONG-TERM; CLASSIFICATION; NET;
D O I
10.1016/j.isprsjprs.2023.11.006
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
As one of the most significant components of the ecosystem, waterbody needs to be highly monitored at different spatial and temporal scales. Nevertheless, waterbody variations in shape, size, and reflectivity, complicated and varied types of land covers, and environmental scene diversity, present colossal challenges in achieving accurate waterbody detection (WD). In this paper, we propose a novel network coupled with the Transformer and convolutional neural network (CNN), termed WaterFormer, to automatically, efficiently, and accurately delineate waterbodies from optical high-resolution remotely sensed (HR-RS) images. This network mainly includes a dualstream CNN, a cross-level Vision Transformer, a light-weight attention module, and a sub-pixel up-sampling module. First, the dual-stream network abstracts waterbody features at multi-views and different levels. Then, to exploit the long-range dependencies between low-level spatial information and high-order semantic features, the cross-level Vision Transformer is embedded into the dual-stream, aiming at improving WD accuracy. Afterwards, the light-weight attention module is adopted to provide semantically strong feature abstractions by enhancing discrimination neurons, and the sub-pixel up-sampling module is employed to further generate high-resolution and high-quality class-specific representations. Quantitative and qualitative evaluations demonstrated that the WaterFormer provided a promising means for detecting waterbody areas in satellite images under complex scene conditions. Moreover, comparative analyses with the state-of-the-art (SOTA) alternatives, e.g., MSFENet, MSAFNet, and BiSeNet, also verified the generalization and superiority of the WaterFormer in WD tasks. The assessment results exhibited that the WaterFormer gained an average accuracy of 97.24%, average precision of 94.59%, average recall of 91.95%, average F1-score of 93.24%, and average Kappa index of 0.9133, respectively. Additionally, we presented an open-access HR satellite imagery waterbody dataset, a mesoscale dataset with high-quality and high-precision waterbody annotation to facilitate future research in this field. The dataset has been released at https://github.com/NJdeuK/WD_Dataset.
引用
收藏
页码:222 / 241
页数:20
相关论文
共 50 条
  • [41] Analysis of remotely-sensed imagery using the level-crossing statistics texture descriptor
    Santamaria, C
    Bober, M
    Szajnowski, W
    Aso, N
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING X, 2004, 5573 : 115 - 125
  • [42] Fusion of remotely-sensed imagery using a unique wavelet-based reconstruction method
    Kozaitis, Samuel P.
    Ouendeno, Michel
    PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2006, : 113 - +
  • [43] Constrained subpixel target detection for remotely sensed imagery
    Chang, CI
    Heinz, DC
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2000, 38 (03): : 1144 - 1159
  • [44] Remotely-sensed detection of effects of extreme droughts on gross primary production
    Sara Vicca
    Manuela Balzarolo
    Iolanda Filella
    André Granier
    Mathias Herbst
    Alexander Knohl
    Bernard Longdoz
    Martina Mund
    Zoltan Nagy
    Krisztina Pintér
    Serge Rambal
    Jan Verbesselt
    Aleixandre Verger
    Achim Zeileis
    Chao Zhang
    Josep Peñuelas
    Scientific Reports, 6
  • [45] Haze Detection and Removal in Remotely Sensed Multispectral Imagery
    Makarau, Aliaksei
    Richter, Rudolf
    Mueller, Rupert
    Reinartz, Peter
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (09): : 5895 - 5905
  • [46] Remotely-sensed detection of effects of extreme droughts on gross primary production
    Vicca, Sara
    Balzarolo, Manuela
    Filella, Iolanda
    Granier, Andre
    Herbst, Mathias
    Knohl, Alexander
    Longdoz, Bernard
    Mund, Martina
    Nagy, Zoltan
    Pinter, Krisztina
    Rambal, Serge
    Verbesselt, Jan
    Verger, Aleixandre
    Zeileis, Achim
    Zhang, Chao
    Penuelas, Josep
    SCIENTIFIC REPORTS, 2016, 6
  • [47] Neural Network Combination by Fuzzy Integral for Robust Change Detection in Remotely Sensed Imagery
    Hassiba Nemmour
    Youcef Chibani
    EURASIP Journal on Advances in Signal Processing, 2005
  • [48] Neural network combination by fuzzy integral for robust change detection in remotely sensed imagery
    Nemmour, H
    Chibani, Y
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (14) : 2187 - 2195
  • [49] Neural network combination by fuzzy integral for robust change detection in remotely sensed imagery
    Nemmour, Hassiba
    Chibani, Youcef
    Eurasip Journal on Applied Signal Processing, 2005, 2005 (14): : 2187 - 2195
  • [50] Perspectives on using remotely-sensed imagery in predictive veterinary epidemiology and global early warning systems
    Martin, Vincent
    De Simone, Lorenzo
    Lubroth, Juan
    Ceccato, Pietro
    Chevalier, Veronique
    GEOSPATIAL HEALTH, 2007, 2 (01) : 3 - 14