Deep multimodal fusion for semantic image segmentation: A survey

被引:100
|
作者
Zhang, Yifei [1 ]
Sidibe, Desire [2 ]
Morel, Olivier [1 ]
Meriaudeau, Fabrice [1 ]
机构
[1] Univ Bourgogne Franche Comte, ImViA, VIBOT ERL CNRS 6000, F-71200 Le Creusot, France
[2] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry, France
关键词
Image fusion; Multi-modal; Deep learning; Semantic segmentation; NEURAL-NETWORKS; RGB-D; POLARIZATION; RECOGNITION; VISION;
D O I
10.1016/j.imavis.2020.104042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in deep learning have shown excellent performance in various scene understanding tasks. However, in some complex environments or under challenging conditions, it is necessary to employ multiple modalities that provide complementary information on the same scene. A variety of studies have demonstrated that deep multimodal fusion for semantic image segmentation achieves significant performance improvement. These fusion approaches take the benefits of multiple information sources and generate an optimal joint prediction automatically. This paper describes the essential background concepts of deep multimodal fusion and the relevant applications in computer vision. In particular, we provide a systematic survey of multimodal fusion methodologies, multimodal segmentation datasets, and quantitative evaluations on the benchmark datasets. Existing fusion methods are summarized according to a common taxonomy: early fusion, late fusion, and hybrid fusion. Based on their performance, we analyze the strengths and weaknesses of different fusion strategies. Current challenges and design choices are discussed, aiming to provide the reader with a comprehensive and heuristic view of deep multimodal image segmentation. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Multimodal Deep Learning in Semantic Image Segmentation: A Review
    Raman, Vishal
    Kumari, Madhu
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT 2018), 2018, : 7 - 11
  • [2] Deep Multimodal Fusion Network for Semantic Segmentation Using Remote Sensing Image and LiDAR Data
    Sun, Yangjie
    Fu, Zhongliang
    Sun, Chuanxia
    Hu, Yinglei
    Zhang, Shengyuan
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] A Survey of Image Semantic Segmentation Based on Deep Network
    Luo, Hui-Lan
    Zhang, Yun
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (10): : 2211 - 2220
  • [4] A survey on deep learning techniques for image and video semantic segmentation
    Garcia-Garcia, Alberto
    Orts-Escolano, Sergio
    Oprea, Sergiu
    Villena-Martinez, Victor
    Martinez-Gonzalez, Pablo
    Garcia-Rodriguez, Jose
    [J]. APPLIED SOFT COMPUTING, 2018, 70 : 41 - 65
  • [5] A Survey on Image Semantic Segmentation Using Deep Learning Techniques
    Cheng, Jieren
    Li, Hua
    Li, Dengbo
    Hua, Shuai
    Sheng, Victor S.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1941 - 1957
  • [6] A Review of Optical and SAR Image Deep Feature Fusion in Semantic Segmentation
    Liu, Chenfang
    Sun, Yuli
    Xu, Yanjie
    Sun, Zhongzhen
    Zhang, Xianghui
    Lei, Lin
    Kuang, Gangyao
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 12910 - 12930
  • [7] Tissue segmentation for traumatic brain injury based on multimodal MRI image fusion-semantic segmentation
    Xu, Yao
    Chen, Zhongmin
    Wang, Xiaohui
    Jiang, Shanghai
    Wang, Fuping
    Lu, Hong
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [8] Exploration of Deep Learning-based Multimodal Fusion for Semantic Road Scene Segmentation
    Zhang, Yifei
    Morel, Olivier
    Blanchon, Marc
    Seulin, Ralph
    Rastgoo, Mojdeh
    Sidibe, Desire
    [J]. PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 336 - 343
  • [9] Image semantic segmentation with hierarchical feature fusion based on deep neural network
    Yang, Dawei
    Du, Yan
    Yao, Hongli
    Bao, Liyan
    [J]. CONNECTION SCIENCE, 2022, 34 (01) : 1772 - 1784
  • [10] Survey on Semantic Image Segmentation Techniques
    Sevak, Jay S.
    Kapadia, Aerika D.
    Chavda, Jaiminkumar B.
    Shah, Arpita
    Rahevar, Mrugendrasinh
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SUSTAINABLE SYSTEMS (ICISS 2017), 2017, : 306 - 313