Integrated Heterogeneous Graph Attention Network for Incomplete Multi-modal Clustering

被引:1
|
作者
Wang, Yu [1 ,2 ,3 ]
Yao, Xinjie [1 ,2 ,3 ]
Zhu, Pengfei [1 ,2 ,3 ]
Li, Weihao [4 ]
Cao, Meng [1 ]
Hu, Qinghua [1 ,2 ,3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Minist Educ Peoples Republ China, Engn Res Ctr City Intelligence & Digital Governanc, Tianjin 300072, Peoples R China
[3] Haihe Lab ITAI, Tianjin, Peoples R China
[4] Boston Univ, Dept Comp Sci, Boston, MA USA
基金
中国国家自然科学基金;
关键词
Incomplete multi-modal clustering; Integrated heterogeneous graph; Graph attention;
D O I
10.1007/s11263-024-02066-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Incomplete multi-modal clustering (IMmC) is challenging due to the unexpected missing of some modalities in data. A key to this problem is to explore complementarity information among different samples with incomplete information of unpaired data. Despite preliminary progress, existing methods suffer from (1) relying heavily on paired data, and (2) difficulty in mining complementarity on data with high missing rates. To address the problems, we propose a novel method, Integrated Heterogeneous Graph ATtention (IHGAT) network, for IMmC. To fully exploit the complementarity among different samples and modalities, we first construct a set of integrated heterogeneous graphs based on the similarity graph learned from unified latent representations and the modality-specific availability graphs formed by the existing relations of different samples. Thereafter, the attention mechanism is applied to the constructed integrated heterogeneous graph to aggregate the embedded content of heterogeneous neighbors for each node. In this way, the representations of missing modalities can be learned based on the complementarity information of other samples and their other modalities. Finally, the consistency of probability distribution is embedded into the network for clustering. Consequently, the proposed method can form a complete latent space where incomplete information can be supplemented by other related samples via the learned intrinsic structure. Extensive experiments on eight public datasets show that the proposed IHGAT outperforms existing methods under various settings and is typically more robust in cases of high missing rates.
引用
收藏
页码:3847 / 3866
页数:20
相关论文
共 50 条
  • [21] A joint hierarchical cross-attention graph convolutional network for multi-modal facial expression recognition
    Xu, Chujie
    Du, Yong
    Wang, Jingzi
    Zheng, Wenjie
    Li, Tiejun
    Yuan, Zhansheng
    [J]. COMPUTATIONAL INTELLIGENCE, 2024, 40 (01)
  • [22] HetEmotionNet: Two-Stream Heterogeneous Graph Recurrent Neural Network for Multi-modal Emotion Recognition
    Jia, Ziyu
    Lin, Youfang
    Wang, Jing
    Feng, Zhiyang
    Xie, Xiangheng
    Chen, Caijie
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1047 - 1056
  • [23] Heterogeneous Spatiotemporal Graph Convolution Network for Multi-Modal Wind-PV Power Collaborative Prediction
    Li, Zhuo
    Ye, Lin
    Song, Xuri
    Luo, Yadi
    Pei, Ming
    Wang, Kaifeng
    Yu, Yijun
    Tang, Yong
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (04) : 5591 - 5608
  • [24] Dense Attention Memory Network for Multi-modal emotion recognition
    Ma, Gailing
    Guo, Xiao
    [J]. 2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 48 - 53
  • [25] Fusing heterogeneous information for multi-modal attributed network embedding
    Yang Jieyi
    Zhu Feng
    Dong Yihong
    Qian Jiangbo
    [J]. Applied Intelligence, 2023, 53 : 22328 - 22347
  • [26] Fusing heterogeneous information for multi-modal attributed network embedding
    Yang, Jieyi
    Zhu, Feng
    Dong, Yihong
    Qian, Jiangbo
    [J]. APPLIED INTELLIGENCE, 2023, 53 (19) : 22328 - 22347
  • [27] MAGDRA: A Multi-modal Attention Graph Network with Dynamic Routing-By-Agreement for multi-label emotion recognition
    Li, Xingye
    Liu, Jin
    Xie, Yurong
    Gong, Peizhu
    Zhang, Xiliang
    He, Huihua
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 283
  • [28] Multi-Modal Graph Interaction for Multi-Graph Convolution Network in Urban Spatiotemporal Forecasting
    Zhang, Lingyu
    Geng, Xu
    Qin, Zhiwei
    Wang, Hongjun
    Wang, Xiao
    Zhang, Ying
    Liang, Jian
    Wu, Guobin
    Song, Xuan
    Wang, Yunhai
    [J]. SUSTAINABILITY, 2022, 14 (19)
  • [29] MBIAN: Multi-level bilateral interactive attention network for multi-modal
    Sun, Kai
    Zhang, Jiangshe
    Wang, Jialin
    Xu, Shuang
    Zhang, Chunxia
    Hu, Junying
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
  • [30] Hierarchical graph attention networks for multi-modal rumor detection on social media
    Xu, Fan
    Zeng, Lei
    Huang, Qi
    Yan, Keyu
    Wang, Mingwen
    Sheng, Victor S.
    [J]. NEUROCOMPUTING, 2024, 569