Strategic Multi-Omics Data Integration via Multi-Level Feature Contrasting and Matching

被引:2
|
作者
Zhang, Jinli [1 ]
Ren, Hongwei [1 ]
Jiang, Zongli [1 ]
Chen, Zheng [2 ]
Yang, Ziwei [3 ]
Matsubara, Yasuko [2 ]
Sakurai, Yasushi [2 ]
机构
[1] Beijing Univ Technol, Dept Comp Sci, Beijing 100022, Peoples R China
[2] Osaka Univ, Inst Sci & Ind Res, Suita, Osaka 5650871, Japan
[3] Kyoto Univ, Bioinformat Ctr, Kyoto 6158540, Japan
基金
日本学术振兴会; 中国国家自然科学基金; 日本科学技术振兴机构;
关键词
Multi-omics; clustering; contrastive learning; self-attention;
D O I
10.1109/TNB.2024.3456797
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The analysis and comprehension of multi-omics data has emerged as a prominent topic in the field of bioinformatics and data science. However, the sparsity characteristics and high dimensionality of omics data pose difficulties in terms of extracting meaningful information. Moreover, the heterogeneity inherent in multiple omics sources makes the effective integration of multi-omics data challenging To tackle these challenges, we propose MFCC-SAtt, a multi-level feature contrast clustering model based on self-attention to extract informative features from multi-omics data. MFCC-SAtt treats each omics type as a distinct modality and employs autoencoders with self-attention for each modality to integrate and compress their respective features into a shared feature space. By utilizing a multi-level feature extraction framework along with incorporating a semantic information extractor, we mitigate optimization conflicts arising from different learning objectives. Additionally, MFCC-SAtt guides deep clustering based on multi-level features which further enhances the quality of output labels. By conducting extensive experiments on multi-omics data, we have validated the exceptional performance of MFCC-SAtt. For instance, in a pan-cancer clustering task, MFCC-SAtt achieved an accuracy of over 80.38%.
引用
收藏
页码:579 / 590
页数:12
相关论文
共 50 条
  • [41] A guide to multi-omics data collection and integration for translational medicine
    Athieniti, Efi
    Spyrou, George M.
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 134 - 149
  • [42] Multi-omics data integration shines a light on the renal medulla
    Hodgin, Jeffrey B.
    Smith, Cathy
    Kretzler, Matthias
    KIDNEY INTERNATIONAL, 2024, 105 (02) : 242 - 244
  • [43] Data integration and predictive modeling methods for multi-omics datasets
    Kim, Minseung
    Tagkopoulos, Ilias
    MOLECULAR OMICS, 2018, 14 (01) : 8 - 25
  • [44] Integration of multi-omics data for survival prediction of lung adenocarcinoma
    Guo, Dingjie
    Wang, Yixian
    Chen, Jing
    Liu, Xin
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 250
  • [45] A roadmap for multi-omics data integration using deep learning
    Kang, Mingon
    Ko, Euiseong
    Mersha, Tesfaye B.
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [46] timeOmics: an R package for longitudinal multi-omics data integration
    Bodein, Antoine
    Scott-Boyer, Marie-Pier
    Perin, Olivier
    Cao, Kim-Anh Le
    Droit, Arnaud
    BIOINFORMATICS, 2022, 38 (02) : 577 - 579
  • [47] A Variational Information Bottleneck Approach to Multi-Omics Data Integration
    Lee, Changhee
    van der Schaar, Mihaela
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [48] Cooperative integration of spatially resolved multi-omics data with COSMOS
    Zhou, Yuansheng
    Xiao, Xue
    Dong, Lei
    Tang, Chen
    Xiao, Guanghua
    Xu, Lin
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [49] A guide to multi-omics data collection and integration for translational medicine
    Athieniti, Efi
    Spyrou, George M.
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 134 - 149
  • [50] MOMIC: A Multi-Omics Pipeline for Data Analysis, Integration and Interpretation
    Madrid-Marquez, Laura
    Rubio-Escudero, Cristina
    Pontes, Beatriz
    Gonzalez-Perez, Antonio
    Riquelme, Jose C.
    Saez, Maria E.
    APPLIED SCIENCES-BASEL, 2022, 12 (08):