EGFNet: Edge-Aware Guidance Fusion Network for RGB-Thermal Urban Scene Parsing

被引:19
|
作者
Dong, Shaohua [1 ]
Zhou, Wujie [1 ]
Xu, Caie [1 ,2 ]
Yan, Weiqing [2 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 308232, Singapore
基金
中国国家自然科学基金;
关键词
Deep supervision; edge map; high-level information; multimodal fusion; RGB-thermal urban scene parsing; MULTIMODAL FUSION; INFORMATION; REFINEMENT;
D O I
10.1109/TITS.2023.3306368
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Urban scene parsing is the core of the intelligent transportation system, and RGB-thermal urban scene parsing has recently attracted increasing research interest in the field of computer vision. However, most existing approaches fail to perform good boundary extraction for prediction maps and cannot fully use high-level features. In addition, these methods simply fuse the features from RGB and thermal modalities but are unable to obtain comprehensive fused features. To address these problems, an edge-aware guidance fusion network (EGFNet) was developed in this study for RGB-thermal urban scene parsing. First, a prior edge map generated using the RGB and thermal images were introduced to capture detailed information in the prediction map and then embed the prior edge cues into the feature maps. To fuse the RGB and thermal information effectively, a multimodal fusion module was designed that guarantees adequate cross-modal fusion. Considering the importance of high-level semantic information, global and semantic information modules were proposed to extract rich semantic information from the high-level features. For decoding, simple elementwise addition was utilized for cascaded feature fusion. Finally, to improve the parsing accuracy, multitask deep supervision was applied to the semantic and boundary maps. Extensive experiments were performed on benchmark datasets to demonstrate the effectiveness of the proposed EGFNet and its superior performance compared with the state-of-the-art methods.
引用
收藏
页码:657 / 669
页数:13
相关论文
共 48 条
  • [31] UTFNet: Uncertainty-Guided Trustworthy Fusion Network for RGB-Thermal Semantic Segmentation
    Wang, Qingwang
    Yin, Cheng
    Song, Haochen
    Shen, Tao
    Gu, Yanfeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [32] GCNet: Grid-like context-aware network for RGB-thermal semantic segmentation
    Liu, Jinfu
    Zhou, Wujie
    Cui, Yueli
    Yu, Lu
    Luo, Ting
    NEUROCOMPUTING, 2022, 506 : 60 - 67
  • [33] CAFseg: A Semantic segmentation network with cross aggregation fusion strategy for RGB-thermal semantic segmentation
    Yi, Shi
    Wu, Lang
    Liu, Xi
    Li, Junjie
    Jiang, Gang
    INFRARED PHYSICS & TECHNOLOGY, 2024, 136
  • [34] FASFLNet: feature adaptive selection and fusion lightweight network for RGB-D indoor scene parsing
    Qian, Xiaohong
    Lin, Xingyang
    Yu, Lu
    Zhou, Wujie
    OPTICS EXPRESS, 2023, 31 (05) : 8029 - 8041
  • [35] CCFNet: Cross-Complementary fusion network for RGB-D scene parsing of clothing images
    Xu, Gao
    Zhou, Wujie
    Qian, Xiaohong
    Ye, Lv
    Lei, Jingsheng
    Yu, Lu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
  • [36] Edge-aware Depth Completion for Point-cloud 3D Scene Visualization on an RGB-D Camera
    Huang, Yung-Lin
    Hsu, Tang-Wei
    Chien, Shao-Yi
    2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 422 - 425
  • [37] PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing
    Zhou, Wujie
    Yang, Enquan
    Lei, Jingsheng
    Wan, Jian
    Yu, Lu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3483 - 3494
  • [38] Cross-Collaborative Fusion-Encoder Network for Robust RGB-Thermal Salient Object Detection
    Liao, Guibiao
    Gao, Wei
    Li, Ge
    Wang, Junle
    Kwong, Sam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7646 - 7661
  • [39] CGFNet: cross-guided fusion network for RGB-thermal semantic segmentation CGI PaperID: 105
    Fu, Yanping
    Chen, Qiaoqiao
    Zhao, Haifeng
    VISUAL COMPUTER, 2022, 38 (9-10): : 3243 - 3252
  • [40] Dual-Space Graph-Based Interaction Network for RGB-Thermal Semantic Segmentation in Electric Power Scene
    Xu, Chang
    Li, Qingwu
    Jiang, Xiongbiao
    Yu, Dabing
    Zhou, Yaqin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1577 - 1592