EGFNet: Edge-Aware Guidance Fusion Network for RGB-Thermal Urban Scene Parsing

被引:19
|
作者
Dong, Shaohua [1 ]
Zhou, Wujie [1 ]
Xu, Caie [1 ,2 ]
Yan, Weiqing [2 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 308232, Singapore
基金
中国国家自然科学基金;
关键词
Deep supervision; edge map; high-level information; multimodal fusion; RGB-thermal urban scene parsing; MULTIMODAL FUSION; INFORMATION; REFINEMENT;
D O I
10.1109/TITS.2023.3306368
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Urban scene parsing is the core of the intelligent transportation system, and RGB-thermal urban scene parsing has recently attracted increasing research interest in the field of computer vision. However, most existing approaches fail to perform good boundary extraction for prediction maps and cannot fully use high-level features. In addition, these methods simply fuse the features from RGB and thermal modalities but are unable to obtain comprehensive fused features. To address these problems, an edge-aware guidance fusion network (EGFNet) was developed in this study for RGB-thermal urban scene parsing. First, a prior edge map generated using the RGB and thermal images were introduced to capture detailed information in the prediction map and then embed the prior edge cues into the feature maps. To fuse the RGB and thermal information effectively, a multimodal fusion module was designed that guarantees adequate cross-modal fusion. Considering the importance of high-level semantic information, global and semantic information modules were proposed to extract rich semantic information from the high-level features. For decoding, simple elementwise addition was utilized for cascaded feature fusion. Finally, to improve the parsing accuracy, multitask deep supervision was applied to the semantic and boundary maps. Extensive experiments were performed on benchmark datasets to demonstrate the effectiveness of the proposed EGFNet and its superior performance compared with the state-of-the-art methods.
引用
收藏
页码:657 / 669
页数:13
相关论文
共 48 条
  • [1] Edge-Aware Guidance Fusion Network for RGB-Thermal Scene Parsing
    Zhou, Wujie
    Dong, Shaohua
    Xu, Caie
    Qian, Yaguan
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3571 - 3579
  • [2] MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing
    Zhou, Wujie
    Lin, Xinyang
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    IEEE Transactions on Multimedia, 2022, 24 : 2526 - 2538
  • [3] MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing
    Zhou, Wujie
    Lin, Xinyang
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2526 - 2538
  • [4] HEFANet: hierarchical efficient fusion and aggregation segmentation network for enhanced rgb-thermal urban scene parsing
    Shen, Zhengwen
    Pan, Zaiyu
    Weng, Yuchen
    Li, Yulian
    Wang, Jiangyu
    Wang, Jun
    APPLIED INTELLIGENCE, 2024, 54 (22) : 11248 - 11266
  • [5] ECFNet: Efficient cross-layer fusion network for real time RGB-Thermal urban scene parsing
    Shen, Zhengwen
    Wang, Jiangyu
    Weng, Yuchen
    Pan, Zaiyu
    Li, Yulian
    Wang, Jun
    DIGITAL SIGNAL PROCESSING, 2024, 151
  • [6] Embedded Control Gate Fusion and Attention Residual Learning for RGB-Thermal Urban Scene Parsing
    Zhou, Wujie
    Lv, Ying
    Lei, Jingsheng
    Yu, Lu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 4794 - 4803
  • [7] Multispectral Fusion Transformer Network for RGB-Thermal Urban Scene Semantic Segmentation
    Zhou, Heng
    Tian, Chunna
    Zhang, Zhenxi
    Huo, Qizheng
    Xie, Yongqiang
    Li, Zhongbo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [8] DCFNet: Dense Complementary Fusion for RGB-Thermal Urban Scene Perception
    Zhang, Yu-Wen Michael
    Zhang, Gang
    Hu, Xiaolin
    ADVANCES IN NEURAL NETWORKS-ISNN 2024, 2024, 14827 : 317 - 327
  • [9] BFTNet: Boundary-Induced Four-Phase Transformer Network for RGB-Thermal Urban Road Scene Parsing
    Zhou, Wujie
    Gong, Tingting
    Fang, Meixin
    Yu, Lu
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [10] RTFNet: RGB-Thermal Fusion Network for Semantic Segmentation of Urban Scenes
    Sun, Yuxiang
    Zuo, Weixun
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03): : 2576 - 2583