Deep Learning Multimodal Fusion for Road Network Extraction: Context and Contour Improvement

被引:4
|
作者
Filho Antonio [1 ]
Shimabukuro, Milton [2 ]
Poz, Aluir Dal [1 ]
机构
[1] Sao Paulo State Univ Unesp, Sch Technol & Sci, Dept Cartog, BR-19060900 Presidente Prudente, SP, Brazil
[2] Sao Paulo State Univ, Dept Math & Comp Sci, BR-01049010 Presidente Prudente, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Deep learning; depth models; multimodal fusion; road network extraction; Unet; SEMANTIC SEGMENTATION; RGB;
D O I
10.1109/LGRS.2023.3291656
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Road extraction is still a challenging topic for researchers. Currently, deep convolution neural networks are state-of-the-art in road network segmentation and are known for their remarkable ability to explore multilevel contexts. Despite this, the architectures still suffer from occlusion and obstructions that cause discontinuities and omissions in extracted road networks. Generally, these effects are minimized with strategies to obtain the context of the scene and not explore the complementarity of knowledge from a diversity of sources. We propose an early fusion network with RGB and surface model images that provide complementary geometric data to improve road surface extraction. Our results demonstrate that Unet_early reaches 71.01% intersection over union (IoU) and 81.95% F1, and the fusion strategy increases the IoU and F1 scores at 2.3% and 1.5%, respectively. Besides, it overpassed the best model without fusion (DeepLabv3+). The Brazilian dataset and architecture implementation are available at https://github.com/tunofilho/ieee_road_multimodal.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Integrating Deep Learning with Logic Fusion for Information Extraction
    Wang, Wenya
    Pan, Sinno Jialin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9225 - 9232
  • [42] A Multimodal Intermediate Fusion Network with Manifold Learning for Stress Detection
    Bodaghi, Morteza
    Hosseini, Majid
    Gottumukkala, Raju
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [43] Deep Binocular Fixation Prediction Using a Hierarchical Multimodal Fusion Network
    Zhou, Wujie
    Liu, Wenyu
    Lei, Jingsheng
    Luo, Ting
    Yu, Lu
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (02) : 476 - 486
  • [44] Multimodal Deep Fusion Network for Visibility Assessment With a Small Training Dataset
    Wang, Han
    Shen, Kecheng
    Yu, Peilun
    Shi, Quan
    Ko, Hanseok
    IEEE ACCESS, 2020, 8 : 217057 - 217067
  • [45] Road network extraction: a neural-dynamic framework based on deep learning and a finite state machine
    Wang, Jun
    Song, Jingwei
    Chen, Mingquan
    Yang, Zhi
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2015, 36 (12) : 3144 - 3169
  • [46] Semantics and Contour Based Interactive Learning Network for Building Footprint Extraction
    Zhu, Xiaoqian
    Zhang, Xiangrong
    Zhang, Tianyang
    Tang, Xu
    Chen, Puhua
    Zhou, Huiyu
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [47] TMFN: a text-based multimodal fusion network with multi-scale feature extraction and unsupervised contrastive learning for multimodal sentiment analysis
    Fu, Junsong
    Fu, Youjia
    Xue, Huixia
    Xu, Zihao
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (02)
  • [48] Deep learning multimodal bilinear fusion network based on vibrational spectroscopy for diagnosis of benign and malignant thyroid tumors
    Zhou, Xuguang
    Chen, Xiangnan
    Song, Haitao
    Lv, Xiaoyi
    Gu, Jin
    Chen, Chen
    Chen, Cheng
    MICROCHEMICAL JOURNAL, 2025, 209
  • [49] Automatic Extraction and Classification of Road Markings Based on Deep Learning
    Huang G.
    Liu X.
    Zhongguo Jiguang/Chinese Journal of Lasers, 2019, 46 (08):
  • [50] Automatic Extraction and Classification of Road Markings Based on Deep Learning
    Huang Gang
    Liu Xianlin
    CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2019, 46 (08):