WaterFormer: A coupled transformer and CNN network for waterbody detection in optical remotely-sensed imagery

被引:5
|
作者
Kang, Jian [1 ]
Guan, Haiyan [1 ]
Ma, Lingfei [2 ]
Wang, Lanying [3 ]
Xu, Zhengsen [1 ]
Li, Jonathan [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Remote Sensing & Geomat Engn, Nanjing 210044, Peoples R China
[2] Cent Univ Finance & Econ, Sch Stat & Math, Beijing 102206, Peoples R China
[3] Univ Waterloo, Dept Geog & Environm Management, Waterloo, ON N2L 3G1, Canada
基金
中国国家自然科学基金;
关键词
Optical remotely-sensed imagery; Convolutional neural networks (CNNs); Visual Transformer; Waterbody detection (WD); Multi-scale feature; Long-range dependency; SURFACE-WATER; BODY EXTRACTION; LONG-TERM; CLASSIFICATION; NET;
D O I
10.1016/j.isprsjprs.2023.11.006
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
As one of the most significant components of the ecosystem, waterbody needs to be highly monitored at different spatial and temporal scales. Nevertheless, waterbody variations in shape, size, and reflectivity, complicated and varied types of land covers, and environmental scene diversity, present colossal challenges in achieving accurate waterbody detection (WD). In this paper, we propose a novel network coupled with the Transformer and convolutional neural network (CNN), termed WaterFormer, to automatically, efficiently, and accurately delineate waterbodies from optical high-resolution remotely sensed (HR-RS) images. This network mainly includes a dualstream CNN, a cross-level Vision Transformer, a light-weight attention module, and a sub-pixel up-sampling module. First, the dual-stream network abstracts waterbody features at multi-views and different levels. Then, to exploit the long-range dependencies between low-level spatial information and high-order semantic features, the cross-level Vision Transformer is embedded into the dual-stream, aiming at improving WD accuracy. Afterwards, the light-weight attention module is adopted to provide semantically strong feature abstractions by enhancing discrimination neurons, and the sub-pixel up-sampling module is employed to further generate high-resolution and high-quality class-specific representations. Quantitative and qualitative evaluations demonstrated that the WaterFormer provided a promising means for detecting waterbody areas in satellite images under complex scene conditions. Moreover, comparative analyses with the state-of-the-art (SOTA) alternatives, e.g., MSFENet, MSAFNet, and BiSeNet, also verified the generalization and superiority of the WaterFormer in WD tasks. The assessment results exhibited that the WaterFormer gained an average accuracy of 97.24%, average precision of 94.59%, average recall of 91.95%, average F1-score of 93.24%, and average Kappa index of 0.9133, respectively. Additionally, we presented an open-access HR satellite imagery waterbody dataset, a mesoscale dataset with high-quality and high-precision waterbody annotation to facilitate future research in this field. The dataset has been released at https://github.com/NJdeuK/WD_Dataset.
引用
收藏
页码:222 / 241
页数:20
相关论文
共 50 条
  • [31] AN AIRCRAFT DETECTION METHOD BASED ON IMPROVED MASK R-CNN IN REMOTELY SENSED IMAGERY
    Zhao, Pengfei
    Gao, Huayu
    Zhang, Yun
    Li, Hongbo
    Yang, Rui
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1370 - 1373
  • [32] Shoreline Detection from PRISMA Hyperspectral Remotely-Sensed Images
    Souto-Ceccon, Paola
    Simarro, Gonzalo
    Ciavola, Paolo
    Taramelli, Andrea
    Armaroli, Clara
    [J]. REMOTE SENSING, 2023, 15 (08)
  • [33] Artificial neural network applications on remotely sensed imagery
    Das, K
    Ding, Q
    Perrizo, W
    [J]. 2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C510 - C515
  • [34] Assimilation of remotely-sensed optical properties to improve marine biogeochemistry modelling
    Ciavatta, Stefano
    Torres, Ricardo
    Martinez-Vicente, Victor
    Smyth, Timothy
    Dall'Olmo, Giorgio
    Polimene, Luca
    Allen, J. Icarus
    [J]. PROGRESS IN OCEANOGRAPHY, 2014, 127 : 74 - 95
  • [35] SPATIAL STRUCTURE, SAMPLING DESIGN AND SCALE IN REMOTELY-SENSED IMAGERY OF A CALIFORNIA SAVANNA WOODLAND
    MCGWIRE, K
    FRIEDL, M
    ESTES, JE
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 1993, 14 (11) : 2137 - 2164
  • [36] GEOMETRIC CORRECTION OF REMOTELY-SENSED IMAGERY USING GROUND CONTROL POINTS AND ORTHOGONAL POLYNOMIALS
    DELEEUW, AJ
    VEUGEN, LMM
    VANSTOKKOM, HTC
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 1988, 9 (10-11) : 1751 - 1759
  • [37] The sensitivity of a neural network for classifying remotely sensed imagery
    Jarvis, CH
    Stuart, N
    [J]. COMPUTERS & GEOSCIENCES, 1996, 22 (09) : 959 - 967
  • [38] Analysis of remotely-sensed imagery using the level-crossing statistics texture descriptor
    Santamaria, C
    Bober, M
    Szajnowski, W
    Aso, N
    [J]. IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING X, 2004, 5573 : 115 - 125
  • [39] Constrained subpixel target detection for remotely sensed imagery
    Chang, CI
    Heinz, DC
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2000, 38 (03): : 1144 - 1159
  • [40] Remotely-sensed detection of effects of extreme droughts on gross primary production
    Sara Vicca
    Manuela Balzarolo
    Iolanda Filella
    André Granier
    Mathias Herbst
    Alexander Knohl
    Bernard Longdoz
    Martina Mund
    Zoltan Nagy
    Krisztina Pintér
    Serge Rambal
    Jan Verbesselt
    Aleixandre Verger
    Achim Zeileis
    Chao Zhang
    Josep Peñuelas
    [J]. Scientific Reports, 6