Cross-Modal Clustering With Deep Correlated Information Bottleneck Method

被引:2
|
作者
Yan, Xiaoqiang [1 ]
Mao, Yiqiao [1 ]
Ye, Yangdong [1 ]
Yu, Hui [2 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450052, Peoples R China
[2] Univ Portsmouth, Sch Creat Technol, Portsmouth PO1 2DJ, Hants, England
基金
中国国家自然科学基金;
关键词
deep clustering; information bottleneck (IB); mutual information; MULTIVIEW;
D O I
10.1109/TNNLS.2023.3269789
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal clustering (CMC) intends to improve the clustering accuracy (ACC) by exploiting the correlations across modalities. Although recent research has made impressive advances, it remains a challenge to sufficiently capture the correlations across modalities due to the high-dimensional nonlinear characteristics of individual modalities and the conflicts in heterogeneous modalities. In addition, the meaningless modality-private information in each modality might become dominant in the process of correlation mining, which also interferes with the clustering performance. To tackle these challenges, we devise a novel deep correlated information bottleneck (DCIB) method, which aims at exploring the correlation information between multiple modalities while eliminating the modality-private information in each modality in an end-to-end manner. Specifically, DCIB treats the CMC task as a two-stage data compression procedure, in which the modality-private information in each modality is eliminated under the guidance of the shared representation of multiple modalities. Meanwhile, the correlations between multiple modalities are preserved from the aspects of feature distributions and clustering assignments simultaneously. Finally, the objective of DCIB is formulated as an objective function based on a mutual information measurement, in which a variational optimization approach is proposed to ensure its convergence. Experimental results on four cross-modal datasets validate the superiority of the DCIB. Code is released at https://github.com/Xiaoqiang-Yan/DCIB.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Deep Mutual Information Maximin for Cross-Modal Clustering
    Mao, Yiqiao
    Yan, Xiaoqiang
    Guo, Qiang
    Ye, Yangdong
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8893 - 8901
  • [2] Cross-modal Retrieval Based on Deep Correlated Network
    Cai, Guoyong
    Feng, Yaogong
    Lin, Qiang
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1226 - 1231
  • [3] Deep cross-modal subspace clustering with Contrastive Neighbour Embedding
    Zhang, Zihao
    Wang, Qianqian
    Pei, Chengquan
    Gao, Quanxue
    [J]. NEUROCOMPUTING, 2024, 576
  • [4] Deep Consistent-Inherent Learning for Cross-Modal Subspace Clustering
    Yuzhuo Feng
    Demin Zhou
    [J]. Guidance,Navigation and Control., 2024, (03) - 146
  • [5] Deep Cross-Modal Hashing
    Jiang, Qing-Yuan
    Li, Wu-Jun
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3270 - 3278
  • [6] Cross-Modal Subspace Clustering via Deep Canonical Correlation Analysis
    Gao, Quanxue
    Lian, Huanhuan
    Wang, Qianqian
    Sun, Gan
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3938 - 3945
  • [7] Deep Consistent-Inherent Learning for Cross-Modal Subspace Clustering
    Feng, Yuzhuo
    Zhou, Demin
    [J]. GUIDANCE NAVIGATION AND CONTROL, 2024,
  • [8] Information Fusion via Deep Cross-Modal Factor Analysis
    Gao, Lei
    Guan, Ling
    [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [9] Cross-modal image retrieval with deep mutual information maximization
    Gu, Chunbin
    Bu, Jiajun
    Zhou, Xixi
    Yao, Chengwei
    Ma, Dongfang
    Yu, Zhi
    Yan, Xifeng
    [J]. NEUROCOMPUTING, 2022, 496 : 166 - 177
  • [10] Cross-Modal Commentator: Automatic Machine Commenting Based on Cross-Modal Information
    Yang, Pengcheng
    Zhang, Zhihan
    Luo, Fuli
    Li, Lei
    Huang, Chengyang
    Sun, Xu
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2680 - 2686