Cross-city matters: A multimodal remote sensing benchmark dataset for cross-city semantic segmentation using high-resolution domain adaptation networks

被引:306
|
作者
Hong, Danfeng [1 ]
Zhang, Bing [1 ,2 ]
Li, Hao [3 ]
Li, Yuxuan [1 ,4 ]
Yao, Jing [1 ]
Li, Chenyu [5 ]
Werner, Martin [3 ]
Chanussot, Jocelyn [6 ]
Zipf, Alexander [7 ]
Zhu, Xiao Xiang [8 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
[3] Tech Univ Munich, Big Geospatial Data Management, D-85521 Munich, Germany
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
[5] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China
[6] Univ Grenoble Alpes, GIPSA Lab, CNRS, Grenoble INP, F-38000 Grenoble, France
[7] Heidelberg Univ, Inst Geog, D-69120 Heidelberg, Germany
[8] Tech Univ Munich, Data Sci Earth Observat, D-80333 Munich, Germany
基金
中国国家自然科学基金;
关键词
Cross-city; Deep learning; Dice loss; Domain adaptation; High-resolution network; Land cover; Multimodal benchmark datasets; Remote sensing; Segmentation; COVER;
D O I
10.1016/j.rse.2023.113856
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Artificial intelligence (AI) approaches nowadays have gained remarkable success in single-modality-dominated remote sensing (RS) applications, especially with an emphasis on individual urban environments (e.g., single cities or regions). Yet these AI models tend to meet the performance bottleneck in the case studies across cities or regions, due to the lack of diverse RS information and cutting-edge solutions with high generalization ability. To this end, we build a new set of multimodal remote sensing benchmark datasets (including hyperspectral, mul-tispectral, SAR) for the study purpose of the cross-city semantic segmentation task (called C2Seg dataset), which consists of two cross-city scenes, i.e., Berlin-Augsburg (in Germany) and Beijing-Wuhan (in China). Beyond the single city, we propose a high-resolution domain adaptation network, HighDAN for short, to promote the AI model's generalization ability from the multi-city environments. HighDAN is capable of retaining the spatially topological structure of the studied urban scene well in a parallel high-to-low resolution fusion fashion but also closing the gap derived from enormous differences of RS image representations between different cities by means of adversarial learning. In addition, the Dice loss is considered in HighDAN to alleviate the class imbalance issue caused by factors across cities. Extensive experiments conducted on the C2Seg dataset show the superiority of our HighDAN in terms of segmentation performance and generalization ability, compared to state-of-the-art com-petitors. The C2Seg dataset and the semantic segmentation toolbox (involving the proposed HighDAN) will be available publicly at https://github.com/danfenghong/RSE_Cross-city.
引用
收藏
页数:17
相关论文
共 47 条
  • [1] Cross-City Semantic Segmentation (C2Seg) in Multimodal Remote Sensing: Outcome of the 2023 IEEE WHISPERS C2Seg Challenge
    Liu, Yuheng
    Wang, Ye
    Zhang, Yifan
    Mei, Shaohui
    Zou, Jiaqi
    Li, Zhuohong
    Lu, Fangxiao
    He, Wei
    Zhang, Hongyan
    Zhao, Huilin
    Chen, Chuan
    Xia, Cong
    Li, Hao
    Vivone, Gemine
    Haensch, Ronny
    Taskin, Gulsen
    Yao, Jing
    Qin, A. K.
    Zhang, Bing
    Chanussot, Jocelyn
    Hong, Danfeng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8851 - 8862
  • [2] Cross-city Landuse classification of remote sensing images via deep transfer learning
    Zhao, Xiangyu
    Hu, Jingliang
    Mou, Lichao
    Xiong, Zhitong
    Zhu, Xiao Xiang
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 122
  • [3] A cross-city exploratory analysis of the robustness of bus transit networks using open-source data
    Jia, Tao
    Liu, Wenxuan
    Liu, Xintao
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2021, 580
  • [4] A Bio-Inspired Visual Perception Transformer for Cross-Domain Semantic Segmentation of High-Resolution Remote Sensing Images
    Wang, Xinyao
    Wang, Haitao
    Jing, Yuqian
    Yang, Xianming
    Chu, Jianbo
    REMOTE SENSING, 2024, 16 (09)
  • [5] A Multilevel-Guided Curriculum Domain Adaptation Approach to Semantic Segmentation for High-Resolution Remote Sensing Images
    Xi, Zhihao
    He, Xiangyu
    Meng, Yu
    Yue, Anzhi
    Chen, Jingbo
    Deng, Yupeng
    Chen, Jiansheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] Cross-Scale Feature Propagation Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zeng, Qiaolin
    Zhou, Jingxiang
    Niu, Xuerui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [7] Unsupervised domain adaptation alignment method for cross-domain semantic segmentation of remote sensing images
    Shen Z.
    Ni H.
    Guan H.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2023, 52 (12): : 1 - 2
  • [8] Deep Adversarial Domain Adaptation Method for Cross-Domain Classification in High-Resolution Remote Sensing Images
    Teng Wenxiu
    Wang Ni
    Chen Taisheng
    Wang Benlin
    Chen Menglin
    Shi Huihui
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (11)
  • [9] Unsupervised Domain Adaptation Semantic Segmentation of High-Resolution Remote Sensing Imagery With Invariant Domain-Level Prototype Memory
    Zhu, Jingru
    Guo, Ya
    Sun, Geng
    Yang, Libo
    Deng, Min
    Chen, Jie
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [10] Unsupervised Domain Adaptation Semantic Segmentation of High-Resolution Remote Sensing Imagery With Invariant Domain-Level Prototype Memory
    Zhu, Jingru
    Guo, Ya
    Sun, Geng
    Yang, Libo
    Deng, Min
    Chen, Jie
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61