Cross-city matters: A multimodal remote sensing benchmark dataset for cross-city semantic segmentation using high-resolution domain adaptation networks

被引:306
|
作者
Hong, Danfeng [1 ]
Zhang, Bing [1 ,2 ]
Li, Hao [3 ]
Li, Yuxuan [1 ,4 ]
Yao, Jing [1 ]
Li, Chenyu [5 ]
Werner, Martin [3 ]
Chanussot, Jocelyn [6 ]
Zipf, Alexander [7 ]
Zhu, Xiao Xiang [8 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
[3] Tech Univ Munich, Big Geospatial Data Management, D-85521 Munich, Germany
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
[5] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China
[6] Univ Grenoble Alpes, GIPSA Lab, CNRS, Grenoble INP, F-38000 Grenoble, France
[7] Heidelberg Univ, Inst Geog, D-69120 Heidelberg, Germany
[8] Tech Univ Munich, Data Sci Earth Observat, D-80333 Munich, Germany
基金
中国国家自然科学基金;
关键词
Cross-city; Deep learning; Dice loss; Domain adaptation; High-resolution network; Land cover; Multimodal benchmark datasets; Remote sensing; Segmentation; COVER;
D O I
10.1016/j.rse.2023.113856
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Artificial intelligence (AI) approaches nowadays have gained remarkable success in single-modality-dominated remote sensing (RS) applications, especially with an emphasis on individual urban environments (e.g., single cities or regions). Yet these AI models tend to meet the performance bottleneck in the case studies across cities or regions, due to the lack of diverse RS information and cutting-edge solutions with high generalization ability. To this end, we build a new set of multimodal remote sensing benchmark datasets (including hyperspectral, mul-tispectral, SAR) for the study purpose of the cross-city semantic segmentation task (called C2Seg dataset), which consists of two cross-city scenes, i.e., Berlin-Augsburg (in Germany) and Beijing-Wuhan (in China). Beyond the single city, we propose a high-resolution domain adaptation network, HighDAN for short, to promote the AI model's generalization ability from the multi-city environments. HighDAN is capable of retaining the spatially topological structure of the studied urban scene well in a parallel high-to-low resolution fusion fashion but also closing the gap derived from enormous differences of RS image representations between different cities by means of adversarial learning. In addition, the Dice loss is considered in HighDAN to alleviate the class imbalance issue caused by factors across cities. Extensive experiments conducted on the C2Seg dataset show the superiority of our HighDAN in terms of segmentation performance and generalization ability, compared to state-of-the-art com-petitors. The C2Seg dataset and the semantic segmentation toolbox (involving the proposed HighDAN) will be available publicly at https://github.com/danfenghong/RSE_Cross-city.
引用
收藏
页数:17
相关论文
共 47 条
  • [21] Self-training guided disentangled adaptation for cross-domain remote sensing image semantic segmentation
    Zhao, Qi
    Lyu, Shuchang
    Zhao, Hongbo
    Liu, Binghao
    Chen, Lijiang
    Cheng, Guangliang
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 127
  • [22] Semantic Segmentation of High-Resolution Remote Sensing Images Using Multiscale Skip Connection Network
    Ma, Bifang
    Chang, Chih-Yung
    IEEE SENSORS JOURNAL, 2022, 22 (04) : 3745 - 3755
  • [23] Semantic segmentation of high-resolution remote sensing images using fully convolutional network with adaptive threshold
    Wu, Zhihuan
    Gao, Yongming
    Li, Lei
    Xue, Junshi
    Li, Yuntao
    CONNECTION SCIENCE, 2019, 31 (02) : 169 - 184
  • [24] IMPROVING SEMANTIC SEGMENTATION OF HIGH-RESOLUTION REMOTE SENSING IMAGES USING WASSERSTEIN GENERATIVE ADVERSARIAL NETWORK
    Hosseinpour, H. R.
    Samadzadegan, F.
    Javan, F. Dadrass
    Motayyeb, S.
    ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/ 4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 48-4, 2023, : 45 - 51
  • [25] Pixel Representation Augmented through Cross-Attention for High-Resolution Remote Sensing Imagery Segmentation
    Luo, Yiyun
    Wang, Jinnian
    Yang, Xiankun
    Yu, Zhenyu
    Tan, Zixuan
    REMOTE SENSING, 2022, 14 (21)
  • [26] Unsupervised Domain Adaptation for Building Extraction of High-Resolution Remote Sensing Imagery Based on Decoupling Style and Semantic Features
    Chen, Jie
    Zhu, Jingru
    He, Peien
    Guo, Ya
    Hong, Liang
    Yang, Yin
    Deng, Min
    Sun, Geng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 17
  • [27] Using high-resolution remote sensing data for habitat suitability models of Bromeliaceae in the city of Merida, Venezuela
    Judith, Caroline
    Schneider, Julio V.
    Schmidt, Marco
    Ortega, Rengifo
    Gaviria, Juan
    Zizka, Georg
    LANDSCAPE AND URBAN PLANNING, 2013, 120 : 107 - 118
  • [28] An Iterative Classification and Semantic Segmentation Network for Old Landslide Detection Using High-Resolution Remote Sensing Images
    Lu, Zili
    Peng, Yuexing
    Li, Wei
    Yu, Junchuan
    Ge, Daqing
    Han, Lingyi
    Xiang, Wei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [29] LodgeNet: Improved rice lodging recognition using semantic segmentation of UAV high-resolution remote sensing images
    Su, Zhongbin
    Wang, Yue
    Xu, Qi
    Gao, Rui
    Kong, Qingming
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 196
  • [30] Accurate semantic segmentation of very high-resolution remote sensing images considering feature state sequences: From benchmark datasets to urban applications
    Wang, Zijie
    Yi, Jizheng
    Chen, Aibin
    Chen, Lijiang
    Lin, Hui
    Xu, Kai
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 220 : 824 - 840