Cancer subtyping with heterogeneous multi-omics data via hierarchical multi -kernel learning

被引:5
|
作者
Wei, Yifang [1 ]
Li, Lingmei [1 ]
Zhao, Xin [1 ]
Yang, Haitao [2 ]
Sa, Jian [1 ]
Cao, Hongyan [1 ]
Cui, Yuehua [3 ]
机构
[1] Shanxi Med Univ, Taiyuan, Peoples R China
[2] Hebei Med Univ, Sch Publ Hlth, Shijiazhuang, Peoples R China
[3] Michigan State Univ, Dept Stat & Probabil, E Lansing, MI 48824 USA
基金
中国国家自然科学基金;
关键词
cancer subtyping; data heterogeneity; kernel fusion; multi-omics data integration; hierarchical multi -kernel learning; RENAL-CELL CARCINOMA; LATENT VARIABLE MODEL; BREAST-CANCER; GENOMIC DATA; MICRORNA;
D O I
10.1093/bib/bbac488
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Differentiating cancer subtypes is crucial to guide personalized treatment and improve the prognosis for patients. Integrating multiomics data can offer a comprehensive landscape of cancer biological process and provide promising ways for cancer diagnosis and treatment. Taking the heterogeneity of different omics data types into account, we propose a hierarchical multi -kernel learning (hMKL) approach, a novel cancer molecular subtyping method to identify cancer subtypes by adopting a two -stage kernel learning strategy. In stage 1, we obtain a composite kernel borrowing the cancer integration via multi -kernel learning (CIMLR) idea by optimizing the kernel parameters for individual omics data type. In stage 2, we obtain a final fused kernel through a weighted linear combination of individual kernels learned from stage 1 using an unsupervised multiple kernel learning method. Based on the final fusion kernel, k -means clustering is applied to identify cancer subtypes. Simulation studies show that hMKL outperforms the one -stage CIMLR method when there is data heterogeneity. hMKL can estimate the number of clusters correctly, which is the key challenge in subtyping. Application to two real data sets shows that hMKL identified meaningful subtypes and key cancer-associated biomarkers. The proposed method provides a novel toolkit for heterogeneous multi-omics data integration and cancer subtypes identification.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Multi-omics data integration for hepatocellular carcinoma subtyping with multi-kernel learning
    Wang, Jiaying
    Miao, Yuting
    Li, Lingmei
    Wu, Yongqing
    Ren, Yan
    Cui, Yuehua
    Cao, Hongyan
    [J]. FRONTIERS IN GENETICS, 2022, 13
  • [2] MOCSS: Multi-omics data clustering and cancer subtyping via shared and specific representation learning
    Chen, Yuxin
    Wen, Yuqi
    Xie, Chenyang
    Chen, Xinjian
    He, Song
    Bo, Xiaochen
    Zhang, Zhongnan
    [J]. ISCIENCE, 2023, 26 (08)
  • [3] Multi-channel Partial Graph Integration Learning of Partial Multi-omics Data for Cancer Subtyping
    Cao, Qing-Qing
    Zhao, Jian-ping
    Zheng, Chun-Hou
    [J]. CURRENT BIOINFORMATICS, 2023, 18 (08) : 680 - 691
  • [4] Multi-omics clustering for cancer subtyping based on latent subspace learning
    Ye, Xiucai
    Shang, Yifan
    Shi, Tianyi
    Zhang, Weihang
    Sakurai, Tetsuya
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [5] Molecular Subtyping of Serous Ovarian Cancer Based on Multi-omics Data
    Zhe Zhang
    Ke Huang
    Chenglei Gu
    Luyang Zhao
    Nan Wang
    Xiaolei Wang
    Dongsheng Zhao
    Chenggang Zhang
    Yiming Lu
    Yuanguang Meng
    [J]. Scientific Reports, 6
  • [6] Deep structure integrative representation of multi-omics data for cancer subtyping
    Yang, Bo
    Yang, Yan
    Su, Xueping
    [J]. BIOINFORMATICS, 2022,
  • [7] Evaluation and comparison of multi-omics data integration methods for cancer subtyping
    Duan, Ran
    Gao, Lin
    Gao, Yong
    Hu, Yuxuan
    Xu, Han
    Huang, Mingfeng
    Song, Kuo
    Wang, Hongda
    Dong, Yongqiang
    Jiang, Chaoqun
    Zhang, Chenxing
    Jia, Songwei
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (08)
  • [8] Deep structure integrative representation of multi-omics data for cancer subtyping
    Yang, Bo
    Yang, Yan
    Su, Xueping
    [J]. BIOINFORMATICS, 2022, 38 (13) : 3337 - 3342
  • [9] MoCLIM: Towards Accurate Cancer Subtyping via Multi-Omics Contrastive Learning with Omics-Inference Modeling
    Yang, Ziwei
    Chen, Zheng
    Matsubara, Yasuko
    Sakurai, Yasushi
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2895 - 2905
  • [10] Molecular Subtyping of Serous Ovarian Cancer Based on Multi-omics Data
    Zhang, Zhe
    Huang, Ke
    Gu, Chenglei
    Zhao, Luyang
    Wang, Nan
    Wang, Xiaolei
    Zhao, Dongsheng
    Zhang, Chenggang
    Lu, Yiming
    Meng, Yuanguang
    [J]. SCIENTIFIC REPORTS, 2016, 6