Multi-task Learning of Semantic Segmentation and Height Estimation for Multi-modal Remote Sensing Images

被引：2

作者：

Mengyu WANG ^{[1
,2
,3
,4
]}

Zhiyuan YAN ^{[1
,4
]}

Yingchao FENG ^{[1
,4
]}

Wenhui DIAO ^{[1
,4
]}

Xian SUN ^{[1
,2
,3
,4
]}

机构：

[1] Aerospace Information Research Institute,Chinese Academy of Sciences

[2] University of Chinese Academy of Sciences

[3] School of Electronic,Electrical and Communication Engineering,University of Chinese Academy of Sciences

[4] Key Laboratory of Network Information System Technology(NIST),Aerospace Information Research Institute,Chinese Academy of Sciences

来源：

Journal of Geodesy and Geoinformation Science | 2023年 / 6卷 / 04期

基金：

国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

P237 [测绘遥感技术]; TP751 [图像处理方法];

学科分类号：

081002 ; 1404 ;

摘要：

Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images. However, as more and more remote sensing data is available, it is a new challenge to comprehensively utilize multi-modal remote sensing data to break through the performance bottleneck of single-modal interpretation. In addition, semantic segmentation and height estimation in remote sensing data are two tasks with strong correlation, but existing methods usually study individual tasks separately, which leads to high computational resource overhead. To this end, we propose a Multi-Task learning framework for Multi-Modal remote sensing images(MM＿MT). Specifically, we design a Cross-Modal Feature Fusion(CMFF) method, which aggregates complementary information of different modalities to improve the accuracy of semantic segmentation and height estimation. Besides, a dual-stream multi-task learning method is introduced for Joint Semantic Segmentation and Height Estimation(JSSHE), extracting common features in a shared network to save time and resources, and then learning task-specific features in two task branches. Experimental results on the public multi-modal remote sensing image dataset Potsdam show that compared to training two tasks independently, multi-task learning saves 20% of training time and achieves competitive performance with mIoU of 83.02% for semantic segmentation and accuracy of 95.26% for height estimation.

引用

页码：27 / 39

页数：13

共 50 条

[1] MULTI-MODAL MULTI-TASK LEARNING FOR SEMANTIC SEGMENTATION OF LAND COVER UNDER CLOUDY CONDITIONS
Xu, Fang
Shi, Yilei
Yang, Wen
Zhu, Xiaoxiang
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6274 - 6277
[2] Multi-Task Learning of Relative Height Estimation and Semantic Segmentation from Single Airborne RGB Images
Lu, Min
Liu, Jiayin
Wang, Feng
Xiang, Yuming
REMOTE SENSING, 2022, 14 (14)
[3] Ticino: A multi-modal remote sensing dataset for semantic segmentation
Barbato, Mirko Paolo
Piccoli, Flavio
Napoletano, Paolo
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
[4] MULTI-TASK LEARNING FOR SEMANTIC CHANGE DETECTION ON VHR REMOTE SENSING IMAGES
Zhou, Yuan
Zhu, Jiahang
Huo, Leigang
Huo, Chunlei
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3247 - 3250
[5] Segmentation of Remote Sensing Images Based on U-Net Multi-Task Learning
Ni Ruiwen
Mu Ye
Li Ji
Zhang Tong
Luo Tianye
Feng Ruilong
Gong He
Hu Tianli
Sun Yu
Guo Ying
Li Shijun
Tyasi, Thobela Louis
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 3263 - 3274
[6] Segmentation of Remote Sensing Images Based on U-Net Multi-Task Learning
Ruiwen, Ni
Ye, Mu
Ji, Li
Tong, Zhang
Tianye, Luo
Ruilong, Feng
He, Gong
Tianli, Hu
Yu, Sun
Ying, Guo
Shijun, Li
Tyasi, Thobela Louis
Computers, Materials and Continua, 2022, 73 (02): : 3263 - 3274
[7] MQANet: Multi-Task Quadruple Attention Network of Multi-Object Semantic Segmentation from Remote Sensing Images
Li, Yuxia
Si, Yu
Tong, Zhonggui
He, Lei
Zhang, Jinglin
Luo, Shiyu
Gong, Yushu
REMOTE SENSING, 2022, 14 (24)
[8] MFMamba: A Mamba-Based Multi-Modal Fusion Network for Semantic Segmentation of Remote Sensing Images
Wang, Yan
Cao, Li
Deng, He
SENSORS, 2024, 24 (22)
[9] Multi-Modal Multi-Task (3MT) Road Segmentation
Milli, Erkan
Erkent, Ozgur
Ylmaz, Asm Egemen
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5408 - 5415
[10] MultiNet: Multi-Modal Multi-Task Learning for Autonomous Driving
Chowdhuri, Sauhaarda
Pankaj, Tushar
Zipser, Karl
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1496 - 1504

← 1 2 3 4 5 →