Multi-Modal Object Tracking and Image Fusion With Unsupervised Deep Learning

被引:11
|
作者
LaHaye, Nicholas [1 ,2 ]
Ott, Jordan [1 ]
Garay, Michael J. [3 ]
El-Askary, Hesham Mohamed [4 ,5 ,6 ]
Linstead, Erik [5 ,7 ]
机构
[1] Chapman Univ, Computat & Data Sci Dept, Orange, CA 92866 USA
[2] CALTECH, Jet Prop Lab, Proc Algorithms & Calibrat Engn Grp, 4800 Oak Grove Dr, Pasadena, CA 91109 USA
[3] CALTECH, Jet Prop Lab, 4800 Oak Grove Dr, Pasadena, CA 91109 USA
[4] Chapman Univ, Ctr Excellence Earth Syst Modeling & Observat, Orange, CA 92866 USA
[5] Chapman Univ, Schmid Coll Sci & Technol, Orange, CA 92866 USA
[6] Alexandria Univ, Fac Sci, Dept Environm Sci, Alexandria 21522, Egypt
[7] Chapman Univ, Machine Learning & Assist Technol Lab, Orange, CA 92866 USA
关键词
Bigdata applications; clustering; computer vision; deep belief networks (DBNs); deep learning; CLASSIFICATION; MISR;
D O I
10.1109/JSTARS.2019.2920234
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The number of different modalities for remote sensors continues to grow, bringing with it an increase in the volume and complexity of the data being collected. Although these datasets individually provide valuable information, in aggregate they provide additional opportunities to discover meaningful patterns on a large scale. However, the ability to combine and analyze disparate datasets is challenged by the potentially vast parameter space that results from aggregation. Each dataset in itself requires instrument-specific and dataset-specific knowledge. If the intention is to use multiple, diverse datasets, one needs an understanding of how to translate and combine these parameters in an efficient and effective manner. While there are established techniques for combining datasets from specific domains or platforms, there is no generic, automated method that can address the problem in general. Here, we discuss the application of deep learning to track objects across different image-like data-modalities, given data in a similar spatio-temporal range, and automatically co-register these images. Using deep belief networks combined with unsupervised learning methods, we are able to recognize and separate different objects within image-like data in a structured manner, thus making progress toward the ultimate goal of a generic tracking and fusion pipeline requiring minimal human intervention.
引用
收藏
页码:3056 / 3066
页数:11
相关论文
共 50 条
  • [21] Multi-Modal Fusion Object Tracking Based on Fully Convolutional Siamese Network
    Qi, Ke
    Chen, Liji
    Zhou, Yicong
    Qi, Yutao
    2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 440 - 444
  • [22] Unlocking the power of multi-modal fusion in 3D object tracking
    Hu, Yue
    IET COMPUTER VISION, 2025, 19 (01)
  • [23] Learning Adaptive Fusion Bank for Multi-Modal Salient Object Detection
    Wang, Kunpeng
    Tu, Zhengzheng
    Li, Chenglong
    Zhang, Cheng
    Luo, Bin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7344 - 7358
  • [24] Efficient Multi-Modal Image Fusion Deep Learning Techniques for Classifying Neurodegenerative Disease Type
    Joy, Johnsymol
    Selvan, Mercy Paul
    TRAITEMENT DU SIGNAL, 2025, 42 (01) : 267 - 275
  • [25] Imagery in multi-modal object learning
    Jüttner, M
    Rentschler, I
    BEHAVIORAL AND BRAIN SCIENCES, 2002, 25 (02) : 197 - +
  • [26] Multi-Object Tracking Based on a Novel Feature Image With Multi-Modal Information
    An, Yi
    Wu, Jialin
    Cui, Yunhao
    Hu, Huosheng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (08) : 9909 - 9921
  • [27] Multi-modal deep convolutional dictionary learning for image denoising
    Sun, Zhonggui
    Zhang, Mingzhu
    Sun, Huichao
    Li, Jie
    Liu, Tingting
    Gao, Xinbo
    NEUROCOMPUTING, 2023, 562
  • [28] Multi-modal haptic image recognition based on deep learning
    Han, Dong
    Nie, Hong
    Chen, Jinbao
    Chen, Meng
    Deng, Zhen
    Zhang, Jianwei
    SENSOR REVIEW, 2018, 38 (04) : 486 - 493
  • [29] Deep Convolutional Neural Network for Multi-Modal Image Restoration and Fusion
    Deng, Xin
    Dragotti, Pier Luigi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3333 - 3348
  • [30] Deep fusion of multi-modal features for brain tumor image segmentation
    Zhang, Guying
    Zhou, Jia
    He, Guanghua
    Zhu, Hancan
    HELIYON, 2023, 9 (08)