Geometric Multimodal Deep Learning With Multiscaled Graph Wavelet Convolutional Network

被引：0

作者：

Behmanesh, Maysam ^{[1
]}

Adibi, Peyman ^{[1
]}

Ehsani, Sayyed Mohammad Saeed ^{[1
]}

Chanussot, Jocelyn ^{[2
]}

机构：

[1] Univ Isfahan, Fac Comp Engn, Artificial Intelligence Dept, Esfahan 8174673441, Iran

[2] Univ Grenoble Alpes, GIPSA Lab, CNRS, Grenoble INP, Grenoble, France

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 05期

关键词：

Wavelet transforms; Convolution; Wavelet domain; Manifolds; Learning systems; Laplace equations; Deep learning; Geometric deep learning; graph convolution neural networks; graph wavelet transform; multimodal learning; spectral approaches; HYPERSPECTRAL IMAGE CLASSIFICATION; CLASSIFIERS; ATTENTION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal data provide complementary information of a natural phenomenon by integrating data from various domains with very different statistical properties. Capturing the intramodality and cross-modality information of multimodal data is the essential capability of multimodal learning methods. The geometry-aware data analysis approaches provide these capabilities by implicitly representing data in various modalities based on their geometric underlying structures. Also, in many applications, data are explicitly defined on an intrinsic geometric structure. Generalizing deep learning methods to the non-Euclidean domains is an emerging research field, which has recently been investigated in many studies. Most of those popular methods are developed for unimodal data. In this article, a multimodal graph wavelet convolutional network (M-GWCN) is proposed as an end-to-end network. M-GWCN simultaneously finds intramodality representation by applying the multiscale graph wavelet transform to provide helpful localization properties in the graph domain of each modality and cross-modality representation by learning permutations that encode correlations among various modalities. M-GWCN is not limited to either the homogeneous modalities with the same number of data or any prior knowledge indicating correspondences between modalities. Several semisupervised node classification experiments have been conducted on three popular unimodal explicit graph-based datasets and five multimodal implicit ones. The experimental results indicate the superiority and effectiveness of the proposed methods compared with both spectral graph domain convolutional neural networks and state-of-the-art multimodal methods.

引用

页码：6991 / 7005

页数：15

共 50 条

[1] Graph Wavelet Convolutional Network with Graph Clustering
Inatsuki, Hiroki
Uto, Toshiyuki
[J]. 2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 165 - 168
[2] Graph Wavelet Convolutional Neural Network for Spatiotemporal Graph Modeling
Jiang, Shan
Ding, Zhi-Ming
Zhu, Mei-Ling
Yan, Jin
Xu, Xin-Run
[J]. Ruan Jian Xue Bao/Journal of Software, 2021, 32 (03): : 726 - 741
[3] A Deep Graph Wavelet Convolutional Neural Network for Semi-supervised Node Classification
Wang, Jingyi
Deng, Zhidong
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[4] Towards More Accurate Matching of Contactless Fingerprints With a Deep Geometric Graph Convolutional Network
Shi, Yelin
Zhang, Zhao
Liu, Shuxin
Liu, Manhua
[J]. IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 5 (01): : 29 - 38
[5] Robust graph learning with graph convolutional network
Wan, Yingying
Yuan, Changan
Zhan, Mengmeng
Chen, Long
[J]. INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (03)
[6] Multimodal heterogeneous graph convolutional network for image recommendation
Weiyi Wei
Jian Wang
Mengyu Xu
Futong Zhang
[J]. Multimedia Systems, 2023, 29 : 2747 - 2760
[7] Graph Convolutional Neural Network for Multimodal Movie Recommendation
Mondal, Prabir
Chakder, Daipayan
Raj, Subham
Saha, Sriparna
Onoe, Naoyuki
[J]. 38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1633 - 1640
[8] Multimodal heterogeneous graph convolutional network for image recommendation
Wei, Weiyi
Wang, Jian
Xu, Mengyu
Zhang, Futong
[J]. MULTIMEDIA SYSTEMS, 2023, 29 (5) : 2747 - 2760
[9] Learning multimodal word representation with graph convolutional networks
Zhu, Wenhao
Liu, Shuang
Liu, Chaoming
[J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (06)
[10] Implementation of deep convolutional neural network for classification of multiscaled and multiangled remote sensing scene
Alegavi, S. S.
Sedamkar, R. R.
[J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2020, 14 (01): : 21 - 34

← 1 2 3 4 5 →