WaterFormer: A coupled transformer and CNN network for waterbody detection in optical remotely-sensed imagery

被引:5
|
作者
Kang, Jian [1 ]
Guan, Haiyan [1 ]
Ma, Lingfei [2 ]
Wang, Lanying [3 ]
Xu, Zhengsen [1 ]
Li, Jonathan [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Remote Sensing & Geomat Engn, Nanjing 210044, Peoples R China
[2] Cent Univ Finance & Econ, Sch Stat & Math, Beijing 102206, Peoples R China
[3] Univ Waterloo, Dept Geog & Environm Management, Waterloo, ON N2L 3G1, Canada
基金
中国国家自然科学基金;
关键词
Optical remotely-sensed imagery; Convolutional neural networks (CNNs); Visual Transformer; Waterbody detection (WD); Multi-scale feature; Long-range dependency; SURFACE-WATER; BODY EXTRACTION; LONG-TERM; CLASSIFICATION; NET;
D O I
10.1016/j.isprsjprs.2023.11.006
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
As one of the most significant components of the ecosystem, waterbody needs to be highly monitored at different spatial and temporal scales. Nevertheless, waterbody variations in shape, size, and reflectivity, complicated and varied types of land covers, and environmental scene diversity, present colossal challenges in achieving accurate waterbody detection (WD). In this paper, we propose a novel network coupled with the Transformer and convolutional neural network (CNN), termed WaterFormer, to automatically, efficiently, and accurately delineate waterbodies from optical high-resolution remotely sensed (HR-RS) images. This network mainly includes a dualstream CNN, a cross-level Vision Transformer, a light-weight attention module, and a sub-pixel up-sampling module. First, the dual-stream network abstracts waterbody features at multi-views and different levels. Then, to exploit the long-range dependencies between low-level spatial information and high-order semantic features, the cross-level Vision Transformer is embedded into the dual-stream, aiming at improving WD accuracy. Afterwards, the light-weight attention module is adopted to provide semantically strong feature abstractions by enhancing discrimination neurons, and the sub-pixel up-sampling module is employed to further generate high-resolution and high-quality class-specific representations. Quantitative and qualitative evaluations demonstrated that the WaterFormer provided a promising means for detecting waterbody areas in satellite images under complex scene conditions. Moreover, comparative analyses with the state-of-the-art (SOTA) alternatives, e.g., MSFENet, MSAFNet, and BiSeNet, also verified the generalization and superiority of the WaterFormer in WD tasks. The assessment results exhibited that the WaterFormer gained an average accuracy of 97.24%, average precision of 94.59%, average recall of 91.95%, average F1-score of 93.24%, and average Kappa index of 0.9133, respectively. Additionally, we presented an open-access HR satellite imagery waterbody dataset, a mesoscale dataset with high-quality and high-precision waterbody annotation to facilitate future research in this field. The dataset has been released at https://github.com/NJdeuK/WD_Dataset.
引用
收藏
页码:222 / 241
页数:20
相关论文
共 50 条
  • [1] Training strategies for neural network soft classification of remotely-sensed imagery
    Bernard, AC
    Wilkinson, GG
    Kanellopoulos, I
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 1997, 18 (08) : 1851 - 1856
  • [2] APPLICATION OF MULTISPECTRAL REMOTELY-SENSED IMAGERY IN AGRICULTURE
    Kokhan, S.
    [J]. 100 YEARS ISPRS ADVANCING REMOTE SENSING SCIENCE, PT 2, 2010, 38 : 337 - 341
  • [3] REGION ADJACENCY ANALYSIS OF REMOTELY-SENSED IMAGERY
    NICHOL, DG
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 1990, 11 (11) : 2089 - 2101
  • [4] PIXEL SAMPLING OF REMOTELY-SENSED DIGITAL IMAGERY - REPLY
    WACKERNAGEL, H
    [J]. COMPUTERS & GEOSCIENCES, 1994, 20 (7-8) : 1265 - 1265
  • [5] Shoreline feature extraction from remotely-sensed imagery
    Loos, EA
    Niemann, KO
    [J]. IGARSS 2002: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM AND 24TH CANADIAN SYMPOSIUM ON REMOTE SENSING, VOLS I-VI, PROCEEDINGS: REMOTE SENSING: INTEGRATING OUR VIEW OF THE PLANET, 2002, : 3417 - 3419
  • [6] MODEL-BASED REMOTELY-SENSED IMAGERY INTERPRETATION
    WU, JK
    CHENG, DS
    WANG, WT
    CAI, DL
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 1988, 9 (08) : 1347 - 1356
  • [7] Using maps to automate the classification of remotely-sensed imagery
    Carlotto, MJ
    [J]. ALGORITHMS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGERY II, 1996, 2758 : 40 - 50
  • [8] PIXEL SAMPLING OF REMOTELY-SENSED DIGITAL IMAGERY - COMMENT
    VANDERWEL, FJM
    JANSSEN, LLF
    [J]. COMPUTERS & GEOSCIENCES, 1994, 20 (7-8) : 1263 - 1264
  • [10] Impact of lossy compression on the classification of remotely-sensed imagery data
    Saghri, JA
    Tescher, AG
    Mohammad, B
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XXIII, 2000, 4115 : 121 - 129