EGFNet: Edge-Aware Guidance Fusion Network for RGB-Thermal Urban Scene Parsing

被引：19

作者：

Dong, Shaohua ^{[1
]}

Zhou, Wujie ^{[1
]}

Xu, Caie ^{[1
,2
]}

Yan, Weiqing ^{[2
]}

机构：

[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China

[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 308232, Singapore

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Deep supervision; edge map; high-level information; multimodal fusion; RGB-thermal urban scene parsing; MULTIMODAL FUSION; INFORMATION; REFINEMENT;

D O I：

10.1109/TITS.2023.3306368

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Urban scene parsing is the core of the intelligent transportation system, and RGB-thermal urban scene parsing has recently attracted increasing research interest in the field of computer vision. However, most existing approaches fail to perform good boundary extraction for prediction maps and cannot fully use high-level features. In addition, these methods simply fuse the features from RGB and thermal modalities but are unable to obtain comprehensive fused features. To address these problems, an edge-aware guidance fusion network (EGFNet) was developed in this study for RGB-thermal urban scene parsing. First, a prior edge map generated using the RGB and thermal images were introduced to capture detailed information in the prediction map and then embed the prior edge cues into the feature maps. To fuse the RGB and thermal information effectively, a multimodal fusion module was designed that guarantees adequate cross-modal fusion. Considering the importance of high-level semantic information, global and semantic information modules were proposed to extract rich semantic information from the high-level features. For decoding, simple elementwise addition was utilized for cascaded feature fusion. Finally, to improve the parsing accuracy, multitask deep supervision was applied to the semantic and boundary maps. Extensive experiments were performed on benchmark datasets to demonstrate the effectiveness of the proposed EGFNet and its superior performance compared with the state-of-the-art methods.

引用

页码：657 / 669

页数：13

共 48 条

[31] UTFNet: Uncertainty-Guided Trustworthy Fusion Network for RGB-Thermal Semantic Segmentation
Wang, Qingwang
Yin, Cheng
Song, Haochen
Shen, Tao
Gu, Yanfeng
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[32] GCNet: Grid-like context-aware network for RGB-thermal semantic segmentation
Liu, Jinfu
Zhou, Wujie
Cui, Yueli
Yu, Lu
Luo, Ting
NEUROCOMPUTING, 2022, 506 : 60 - 67
[33] CAFseg: A Semantic segmentation network with cross aggregation fusion strategy for RGB-thermal semantic segmentation
Yi, Shi
Wu, Lang
Liu, Xi
Li, Junjie
Jiang, Gang
INFRARED PHYSICS & TECHNOLOGY, 2024, 136
[34] FASFLNet: feature adaptive selection and fusion lightweight network for RGB-D indoor scene parsing
Qian, Xiaohong
Lin, Xingyang
Yu, Lu
Zhou, Wujie
OPTICS EXPRESS, 2023, 31 (05) : 8029 - 8041
[35] CCFNet: Cross-Complementary fusion network for RGB-D scene parsing of clothing images
Xu, Gao
Zhou, Wujie
Qian, Xiaohong
Ye, Lv
Lei, Jingsheng
Yu, Lu
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
[36] Edge-aware Depth Completion for Point-cloud 3D Scene Visualization on an RGB-D Camera
Huang, Yung-Lin
Hsu, Tang-Wei
Chien, Shao-Yi
2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 422 - 425
[37] PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing
Zhou, Wujie
Yang, Enquan
Lei, Jingsheng
Wan, Jian
Yu, Lu
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3483 - 3494
[38] Cross-Collaborative Fusion-Encoder Network for Robust RGB-Thermal Salient Object Detection
Liao, Guibiao
Gao, Wei
Li, Ge
Wang, Junle
Kwong, Sam
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7646 - 7661
[39] CGFNet: cross-guided fusion network for RGB-thermal semantic segmentation CGI PaperID: 105
Fu, Yanping
Chen, Qiaoqiao
Zhao, Haifeng
VISUAL COMPUTER, 2022, 38 (9-10): : 3243 - 3252
[40] Dual-Space Graph-Based Interaction Network for RGB-Thermal Semantic Segmentation in Electric Power Scene
Xu, Chang
Li, Qingwu
Jiang, Xiongbiao
Yu, Dabing
Zhou, Yaqin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1577 - 1592

← 1 2 3 4 5 →