Integrated Heterogeneous Graph Attention Network for Incomplete Multi-modal Clustering

被引:1
|
作者
Wang, Yu [1 ,2 ,3 ]
Yao, Xinjie [1 ,2 ,3 ]
Zhu, Pengfei [1 ,2 ,3 ]
Li, Weihao [4 ]
Cao, Meng [1 ]
Hu, Qinghua [1 ,2 ,3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Minist Educ Peoples Republ China, Engn Res Ctr City Intelligence & Digital Governanc, Tianjin 300072, Peoples R China
[3] Haihe Lab ITAI, Tianjin, Peoples R China
[4] Boston Univ, Dept Comp Sci, Boston, MA USA
基金
中国国家自然科学基金;
关键词
Incomplete multi-modal clustering; Integrated heterogeneous graph; Graph attention;
D O I
10.1007/s11263-024-02066-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Incomplete multi-modal clustering (IMmC) is challenging due to the unexpected missing of some modalities in data. A key to this problem is to explore complementarity information among different samples with incomplete information of unpaired data. Despite preliminary progress, existing methods suffer from (1) relying heavily on paired data, and (2) difficulty in mining complementarity on data with high missing rates. To address the problems, we propose a novel method, Integrated Heterogeneous Graph ATtention (IHGAT) network, for IMmC. To fully exploit the complementarity among different samples and modalities, we first construct a set of integrated heterogeneous graphs based on the similarity graph learned from unified latent representations and the modality-specific availability graphs formed by the existing relations of different samples. Thereafter, the attention mechanism is applied to the constructed integrated heterogeneous graph to aggregate the embedded content of heterogeneous neighbors for each node. In this way, the representations of missing modalities can be learned based on the complementarity information of other samples and their other modalities. Finally, the consistency of probability distribution is embedded into the network for clustering. Consequently, the proposed method can form a complete latent space where incomplete information can be supplemented by other related samples via the learned intrinsic structure. Extensive experiments on eight public datasets show that the proposed IHGAT outperforms existing methods under various settings and is typically more robust in cases of high missing rates.
引用
收藏
页码:3847 / 3866
页数:20
相关论文
共 50 条
  • [1] Adversarial Graph Attention Network for Multi-modal Cross-modal Retrieval
    Wu, Hongchang
    Guan, Ziyu
    Zhi, Tao
    zhao, Wei
    Xu, Cai
    Han, Hong
    Yang, Yarning
    [J]. 2019 10TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK 2019), 2019, : 265 - 272
  • [2] Graph Convolutional Incomplete Multi-modal Hashing
    Shen, Xiaobo
    Chen, Yinfan
    Pan, Shirui
    Liu, Weiwei
    Zheng, Yuhui
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7029 - 7037
  • [3] Heterogeneous-Grained Multi-Modal Graph Network for Outfit Recommendation
    Xu, Rucong
    Wang, Jianfeng
    Li, Yun
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1788 - 1799
  • [4] Corporate Relative Valuation Using Heterogeneous Multi-Modal Graph Neural Network
    Yang, Yang
    Yang, Jia-Qi
    Bao, Ran
    Zhan, De-Chuan
    Zhu, Hengshu
    Gao, Xiao-Ru
    Xiong, Hui
    Yang, Jian
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 211 - 224
  • [5] Hyper-node Relational Graph Attention Network for Multi-modal Knowledge Graph Completion
    Liang, Shuang
    Zhu, Anjie
    Zhang, Jiasheng
    Shao, Jie
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [6] MIA-Net: Multi-Modal Interactive Attention Network for Multi-Modal Affective Analysis
    Li, Shuzhen
    Zhang, Tong
    Chen, Bianna
    Chen, C. L. Philip
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2796 - 2809
  • [7] Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering
    Xia, Wei
    Wang, Tianxiu
    Gao, Quanxue
    Yang, Ming
    Gao, Xinbo
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1170 - 1183
  • [8] Heterogeneous Graph Learning for Multi-Modal Medical Data Analysis
    Kim, Sein
    Lee, Namkyeong
    Lee, Junseok
    Hyun, Dongmin
    Park, Chanyoung
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 5141 - 5150
  • [9] Semi-Supervised Multi-Modal Clustering and Classification with Incomplete Modalities
    Yang, Yang
    Zhan, De-Chuan
    Wu, Yi-Feng
    Liu, Zhi-Bin
    Xiong, Hui
    Jiang, Yuan
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (02) : 682 - 695
  • [10] Flexible Dual Multi-Modal Hashing for Incomplete Multi-Modal Retrieval
    Wei, Yuhong
    An, Junfeng
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024,