Deep Learning Multimodal Fusion for Road Network Extraction: Context and Contour Improvement

被引：4

作者：

Filho Antonio ^{[1
]}

Shimabukuro, Milton ^{[2
]}

Poz, Aluir Dal ^{[1
]}

机构：

[1] Sao Paulo State Univ Unesp, Sch Technol & Sci, Dept Cartog, BR-19060900 Presidente Prudente, SP, Brazil

[2] Sao Paulo State Univ, Dept Math & Comp Sci, BR-01049010 Presidente Prudente, SP, Brazil

来源：

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS | 2023年 / 20卷

基金：

巴西圣保罗研究基金会;

关键词：

Deep learning; depth models; multimodal fusion; road network extraction; Unet; SEMANTIC SEGMENTATION; RGB;

D O I：

10.1109/LGRS.2023.3291656

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Road extraction is still a challenging topic for researchers. Currently, deep convolution neural networks are state-of-the-art in road network segmentation and are known for their remarkable ability to explore multilevel contexts. Despite this, the architectures still suffer from occlusion and obstructions that cause discontinuities and omissions in extracted road networks. Generally, these effects are minimized with strategies to obtain the context of the scene and not explore the complementarity of knowledge from a diversity of sources. We propose an early fusion network with RGB and surface model images that provide complementary geometric data to improve road surface extraction. Our results demonstrate that Unet_early reaches 71.01% intersection over union (IoU) and 81.95% F1, and the fusion strategy increases the IoU and F1 scores at 2.3% and 1.5%, respectively. Besides, it overpassed the best model without fusion (DeepLabv3+). The Brazilian dataset and architecture implementation are available at https://github.com/tunofilho/ieee_road_multimodal.

引用

页数：5

共 50 条

[41] Integrating Deep Learning with Logic Fusion for Information Extraction
Wang, Wenya
Pan, Sinno Jialin
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9225 - 9232
[42] A Multimodal Intermediate Fusion Network with Manifold Learning for Stress Detection
Bodaghi, Morteza
Hosseini, Majid
Gottumukkala, Raju
2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
[43] Deep Binocular Fixation Prediction Using a Hierarchical Multimodal Fusion Network
Zhou, Wujie
Liu, Wenyu
Lei, Jingsheng
Luo, Ting
Yu, Lu
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (02) : 476 - 486
[44] Multimodal Deep Fusion Network for Visibility Assessment With a Small Training Dataset
Wang, Han
Shen, Kecheng
Yu, Peilun
Shi, Quan
Ko, Hanseok
IEEE ACCESS, 2020, 8 : 217057 - 217067
[45] Road network extraction: a neural-dynamic framework based on deep learning and a finite state machine
Wang, Jun
Song, Jingwei
Chen, Mingquan
Yang, Zhi
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2015, 36 (12) : 3144 - 3169
[46] Semantics and Contour Based Interactive Learning Network for Building Footprint Extraction
Zhu, Xiaoqian
Zhang, Xiangrong
Zhang, Tianyang
Tang, Xu
Chen, Puhua
Zhou, Huiyu
Jiao, Licheng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[47] TMFN: a text-based multimodal fusion network with multi-scale feature extraction and unsupervised contrastive learning for multimodal sentiment analysis
Fu, Junsong
Fu, Youjia
Xue, Huixia
Xu, Zihao
COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (02)
[48] Deep learning multimodal bilinear fusion network based on vibrational spectroscopy for diagnosis of benign and malignant thyroid tumors
Zhou, Xuguang
Chen, Xiangnan
Song, Haitao
Lv, Xiaoyi
Gu, Jin
Chen, Chen
Chen, Cheng
MICROCHEMICAL JOURNAL, 2025, 209
[49] Automatic Extraction and Classification of Road Markings Based on Deep Learning
Huang G.
Liu X.
Zhongguo Jiguang/Chinese Journal of Lasers, 2019, 46 (08):
[50] Automatic Extraction and Classification of Road Markings Based on Deep Learning
Huang Gang
Liu Xianlin
CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2019, 46 (08):

← 1 2 3 4 5 →