Multi-task Learning of Semantic Segmentation and Height Estimation for Multi-modal Remote Sensing Images

被引:2
|
作者
Mengyu WANG [1 ,2 ,3 ,4 ]
Zhiyuan YAN [1 ,4 ]
Yingchao FENG [1 ,4 ]
Wenhui DIAO [1 ,4 ]
Xian SUN [1 ,2 ,3 ,4 ]
机构
[1] Aerospace Information Research Institute,Chinese Academy of Sciences
[2] University of Chinese Academy of Sciences
[3] School of Electronic,Electrical and Communication Engineering,University of Chinese Academy of Sciences
[4] Key Laboratory of Network Information System Technology(NIST),Aerospace Information Research Institute,Chinese Academy of Sciences
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
P237 [测绘遥感技术]; TP751 [图像处理方法];
学科分类号
081002 ; 1404 ;
摘要
Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images. However, as more and more remote sensing data is available, it is a new challenge to comprehensively utilize multi-modal remote sensing data to break through the performance bottleneck of single-modal interpretation. In addition, semantic segmentation and height estimation in remote sensing data are two tasks with strong correlation, but existing methods usually study individual tasks separately, which leads to high computational resource overhead. To this end, we propose a Multi-Task learning framework for Multi-Modal remote sensing images(MM_MT). Specifically, we design a Cross-Modal Feature Fusion(CMFF) method, which aggregates complementary information of different modalities to improve the accuracy of semantic segmentation and height estimation. Besides, a dual-stream multi-task learning method is introduced for Joint Semantic Segmentation and Height Estimation(JSSHE), extracting common features in a shared network to save time and resources, and then learning task-specific features in two task branches. Experimental results on the public multi-modal remote sensing image dataset Potsdam show that compared to training two tasks independently, multi-task learning saves 20% of training time and achieves competitive performance with mIoU of 83.02% for semantic segmentation and accuracy of 95.26% for height estimation.
引用
收藏
页码:27 / 39
页数:13
相关论文
共 50 条
  • [1] MULTI-MODAL MULTI-TASK LEARNING FOR SEMANTIC SEGMENTATION OF LAND COVER UNDER CLOUDY CONDITIONS
    Xu, Fang
    Shi, Yilei
    Yang, Wen
    Zhu, Xiaoxiang
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6274 - 6277
  • [2] Multi-Task Learning of Relative Height Estimation and Semantic Segmentation from Single Airborne RGB Images
    Lu, Min
    Liu, Jiayin
    Wang, Feng
    Xiang, Yuming
    REMOTE SENSING, 2022, 14 (14)
  • [3] Ticino: A multi-modal remote sensing dataset for semantic segmentation
    Barbato, Mirko Paolo
    Piccoli, Flavio
    Napoletano, Paolo
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [4] MULTI-TASK LEARNING FOR SEMANTIC CHANGE DETECTION ON VHR REMOTE SENSING IMAGES
    Zhou, Yuan
    Zhu, Jiahang
    Huo, Leigang
    Huo, Chunlei
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3247 - 3250
  • [5] Segmentation of Remote Sensing Images Based on U-Net Multi-Task Learning
    Ni Ruiwen
    Mu Ye
    Li Ji
    Zhang Tong
    Luo Tianye
    Feng Ruilong
    Gong He
    Hu Tianli
    Sun Yu
    Guo Ying
    Li Shijun
    Tyasi, Thobela Louis
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 3263 - 3274
  • [6] Segmentation of Remote Sensing Images Based on U-Net Multi-Task Learning
    Ruiwen, Ni
    Ye, Mu
    Ji, Li
    Tong, Zhang
    Tianye, Luo
    Ruilong, Feng
    He, Gong
    Tianli, Hu
    Yu, Sun
    Ying, Guo
    Shijun, Li
    Tyasi, Thobela Louis
    Computers, Materials and Continua, 2022, 73 (02): : 3263 - 3274
  • [7] MQANet: Multi-Task Quadruple Attention Network of Multi-Object Semantic Segmentation from Remote Sensing Images
    Li, Yuxia
    Si, Yu
    Tong, Zhonggui
    He, Lei
    Zhang, Jinglin
    Luo, Shiyu
    Gong, Yushu
    REMOTE SENSING, 2022, 14 (24)
  • [8] MFMamba: A Mamba-Based Multi-Modal Fusion Network for Semantic Segmentation of Remote Sensing Images
    Wang, Yan
    Cao, Li
    Deng, He
    SENSORS, 2024, 24 (22)
  • [9] Multi-Modal Multi-Task (3MT) Road Segmentation
    Milli, Erkan
    Erkent, Ozgur
    Ylmaz, Asm Egemen
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5408 - 5415
  • [10] MultiNet: Multi-Modal Multi-Task Learning for Autonomous Driving
    Chowdhuri, Sauhaarda
    Pankaj, Tushar
    Zipser, Karl
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1496 - 1504