Geometric Multimodal Deep Learning With Multiscaled Graph Wavelet Convolutional Network

被引:7
|
作者
Behmanesh, Maysam [1 ]
Adibi, Peyman [1 ]
Ehsani, Sayyed Mohammad Saeed [1 ]
Chanussot, Jocelyn [2 ]
机构
[1] Univ Isfahan, Fac Comp Engn, Artificial Intelligence Dept, Esfahan 8174673441, Iran
[2] Univ Grenoble Alpes, GIPSA Lab, CNRS, Grenoble INP, Grenoble, France
关键词
Wavelet transforms; Convolution; Wavelet domain; Manifolds; Learning systems; Laplace equations; Deep learning; Geometric deep learning; graph convolution neural networks; graph wavelet transform; multimodal learning; spectral approaches; HYPERSPECTRAL IMAGE CLASSIFICATION; CLASSIFIERS; ATTENTION;
D O I
10.1109/TNNLS.2022.3213589
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal data provide complementary information of a natural phenomenon by integrating data from various domains with very different statistical properties. Capturing the intramodality and cross-modality information of multimodal data is the essential capability of multimodal learning methods. The geometry-aware data analysis approaches provide these capabilities by implicitly representing data in various modalities based on their geometric underlying structures. Also, in many applications, data are explicitly defined on an intrinsic geometric structure. Generalizing deep learning methods to the non-Euclidean domains is an emerging research field, which has recently been investigated in many studies. Most of those popular methods are developed for unimodal data. In this article, a multimodal graph wavelet convolutional network (M-GWCN) is proposed as an end-to-end network. M-GWCN simultaneously finds intramodality representation by applying the multiscale graph wavelet transform to provide helpful localization properties in the graph domain of each modality and cross-modality representation by learning permutations that encode correlations among various modalities. M-GWCN is not limited to either the homogeneous modalities with the same number of data or any prior knowledge indicating correspondences between modalities. Several semisupervised node classification experiments have been conducted on three popular unimodal explicit graph-based datasets and five multimodal implicit ones. The experimental results indicate the superiority and effectiveness of the proposed methods compared with both spectral graph domain convolutional neural networks and state-of-the-art multimodal methods.
引用
收藏
页码:6991 / 7005
页数:15
相关论文
共 50 条
  • [21] Spatiotemporal Deep-Learning Model With Graph Convolutional Network for Well Logs Prediction
    Feng, Shuo
    Li, Xuegui
    Zeng, Fang
    Hu, Zhongrui
    Sun, Yuhang
    Wang, Zepeng
    Duan, Hanxu
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [22] Automatic Virtual Network Embedding: A Deep Reinforcement Learning Approach With Graph Convolutional Networks
    Yan, Zhongxia
    Ge, Jingguo
    Wu, Yulei
    Li, Liangxiong
    Li, Tong
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (06) : 1040 - 1057
  • [23] Kernel-Weighted Graph Convolutional Network: A Deep Learning Approach for Traffic Forecasting
    Zhang, Qi
    Jin, Qizhao
    Chang, Jianlong
    Xiang, Shiming
    Pan, Chunhong
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1018 - 1023
  • [24] Production Scheduling based on Deep Reinforcement Learning using Graph Convolutional Neural Network
    Seito, Takanari
    Munakata, Satoshi
    [J]. ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 766 - 772
  • [25] Deep geometric convolutional network for automatic modulation classification
    Rundong Li
    Chengtian Song
    Yuxuan Song
    Xiaojun Hao
    Shuyuan Yang
    Xiyu Song
    [J]. Signal, Image and Video Processing, 2020, 14 : 1199 - 1205
  • [26] Sequential Graph Convolutional Network for Active Learning
    Caramalau, Razvan
    Bhattarai, Binod
    Kim, Tae-Kyun
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9578 - 9587
  • [27] Deep geometric convolutional network for automatic modulation classification
    Li, Rundong
    Song, Chengtian
    Song, Yuxuan
    Hao, Xiaojun
    Yang, Shuyuan
    Song, Xiyu
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (06) : 1199 - 1205
  • [28] Traffic Graph Convolutional Recurrent Neural Network: A Deep Learning Framework for Network-Scale Traffic Learning and Forecasting
    Cui, Zhiyong
    Henrickson, Kristian
    Ke, Ruimin
    Wang, Yinhai
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (11) : 4883 - 4894
  • [29] Multimodal Biometrics Recognition Using a Deep Convolutional Neural Network with Transfer Learning in Surveillance Videos
    Aung, Hsu Mon Lei
    Pluempitiwiriyawej, Charnchai
    Hamamoto, Kazuhiko
    Wangsiripitak, Somkiat
    [J]. COMPUTATION, 2022, 10 (07)
  • [30] Multimodal MRI-based classification of migraine: using deep learning convolutional neural network
    Hao Yang
    Junran Zhang
    Qihong Liu
    Yi Wang
    [J]. BioMedical Engineering OnLine, 17