Cross-Modal Clustering With Deep Correlated Information Bottleneck Method

被引:2
|
作者
Yan, Xiaoqiang [1 ]
Mao, Yiqiao [1 ]
Ye, Yangdong [1 ]
Yu, Hui [2 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450052, Peoples R China
[2] Univ Portsmouth, Sch Creat Technol, Portsmouth PO1 2DJ, Hants, England
基金
中国国家自然科学基金;
关键词
deep clustering; information bottleneck (IB); mutual information; MULTIVIEW;
D O I
10.1109/TNNLS.2023.3269789
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal clustering (CMC) intends to improve the clustering accuracy (ACC) by exploiting the correlations across modalities. Although recent research has made impressive advances, it remains a challenge to sufficiently capture the correlations across modalities due to the high-dimensional nonlinear characteristics of individual modalities and the conflicts in heterogeneous modalities. In addition, the meaningless modality-private information in each modality might become dominant in the process of correlation mining, which also interferes with the clustering performance. To tackle these challenges, we devise a novel deep correlated information bottleneck (DCIB) method, which aims at exploring the correlation information between multiple modalities while eliminating the modality-private information in each modality in an end-to-end manner. Specifically, DCIB treats the CMC task as a two-stage data compression procedure, in which the modality-private information in each modality is eliminated under the guidance of the shared representation of multiple modalities. Meanwhile, the correlations between multiple modalities are preserved from the aspects of feature distributions and clustering assignments simultaneously. Finally, the objective of DCIB is formulated as an objective function based on a mutual information measurement, in which a variational optimization approach is proposed to ensure its convergence. Experimental results on four cross-modal datasets validate the superiority of the DCIB. Code is released at https://github.com/Xiaoqiang-Yan/DCIB.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] Contrastive cross-modal clustering with twin network
    Mao, Yiqiao
    Yan, Xiaoqiang
    Hu, Shizhe
    Ye, Yangdong
    [J]. PATTERN RECOGNITION, 2024, 155
  • [22] Cross-modal Scalable Hyperbolic Hierarchical Clustering
    Long, Teng
    van Noord, Nanne
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16609 - 16618
  • [23] INFORMATION COMPLEXITY AND CROSS-MODAL FUNCTIONS
    FREIDES, D
    [J]. BRITISH JOURNAL OF PSYCHOLOGY, 1975, 66 (AUG) : 283 - 287
  • [24] Cross-Modal Retrieval Using Deep De-correlated Subspace Ranking Hashing
    Joslyn, Kevin
    Li, Kai
    Hua, Kien A.
    [J]. ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 55 - 63
  • [25] Clustering-driven Deep Adversarial Hashing for scalable unsupervised cross-modal retrieval
    Shen, Xiao
    Zhang, Haofeng
    Li, Lunbo
    Zhang, Zheng
    Chen, Debao
    Liu, Li
    [J]. NEUROCOMPUTING, 2021, 459 : 152 - 164
  • [26] Enhancing Stock Price Prediction with Deep Cross-Modal Information Fusion Network
    Mandal, Rabi Chandra
    Kler, Rajnish
    Tiwari, Anil
    Keshta, Ismail
    Abonazel, Mohamed R.
    Tageldin, Elsayed M.
    Umaralievich, Mekhmonov Sultonali
    [J]. FLUCTUATION AND NOISE LETTERS, 2024, 23 (02):
  • [27] DEEP PAIRWISE RANKING WITH MULTI-LABEL INFORMATION FOR CROSS-MODAL RETRIEVAL
    Jian, Yangwo
    Xiao, Jing
    Cao, Yang
    Khan, Asad
    Zhu, Jia
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1810 - 1815
  • [28] Learning Disentangled Representation for Cross-Modal Retrieval with Deep Mutual Information Estimation
    Guo, Weikuo
    Huang, Huaibo
    Kong, Xiangwei
    He, Ran
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1712 - 1720
  • [29] Cross-Modal Retrieval Using Deep Learning
    Malik, Shaily
    Bhardwaj, Nikhil
    Bhardwaj, Rahul
    Kumar, Saurabh
    [J]. PROCEEDINGS OF THIRD DOCTORAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE, DOSCI 2022, 2023, 479 : 725 - 734
  • [30] Deep Structured Cross-Modal Anomaly Detection
    Li, Yuening
    Liu, Ninghao
    Li, Jundong
    Du, Mengnan
    Hu, Xia
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,