Multiple kernel learning for integrative consensus clustering of omic datasets

被引:11
|
作者
Cabassi, Alessandra [1 ]
Kirk, Paul D. W. [1 ,2 ]
机构
[1] Univ Cambridge, MRC Biostat Unit, Cambridge CB2 0SR, England
[2] Univ Cambridge, Cambridge Inst Therapeut Immunol & Infect Dis, Cambridge CB2 0AW, England
基金
英国医学研究理事会;
关键词
GENOMIC ANALYSES; CLASS DISCOVERY; CANCER; VISUALIZATION; FRAMEWORK; REVEALS; SUBTYPE; BREAST;
D O I
10.1093/bioinformatics/btaa593
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Diverse applications-particularly in tumour subtyping-have demonstrated the importance of integrative clustering techniques for combining information from multiple data sources. Cluster Of Clusters Analysis (COCA) is one such approach that has been widely applied in the context of tumour subtyping. However, the properties of COCA have never been systematically explored, and its robustness to the inclusion of noisy datasets is unclear. Results: We rigorously benchmark COCA, and present Kernel Learning Integrative Clustering (KLIC) as an alternative strategy. KLIC frames the challenge of combining clustering structures as a multiple kernel learning problem, in which different datasets each provide a weighted contribution to the final clustering. This allows the contribution of noisy datasets to be down-weighted relative to more informative datasets. We compare the performances of KLIC and COCA in a variety of situations through simulation studies. We also present the output of KLIC and COCA in real data applications to cancer subtyping and transcriptional module discovery.
引用
收藏
页码:4789 / 4796
页数:8
相关论文
共 50 条
  • [1] Consensus Affinity Graph Learning for Multiple Kernel Clustering
    Ren, Zhenwen
    Yang, Simon X.
    Sun, Quansen
    Wang, Tao
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (06) : 3273 - 3284
  • [2] InterSIM: Simulation tool for multiple integrative 'omic datasets'
    Chalise, Prabhakar
    Raghavan, Rama
    Fridley, Brooke L.
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2016, 128 : 69 - 74
  • [3] Multiple Kernel Clustering with Direct Consensus Graph Learning
    Wang, Yanlong
    Ren, Zhenwen
    [J]. ADVANCES IN INTELLIGENT SYSTEMS AND COMPUTING (ECC 2021), 2022, 268 : 117 - 127
  • [4] Clustering on Multiple Incomplete Datasets via Collective Kernel Learning
    Shao, Weixiang
    Shi, Xiaoxiao
    Yu, Philip S.
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1181 - 1186
  • [5] Consensus local graph for multiple kernel clustering
    Liu, Zheng
    Huang, Shiluo
    Jin, Wei
    Mu, Ying
    [J]. NEUROCOMPUTING, 2024, 602
  • [6] Multiple kernel subspace clustering with local structural graph and low-rank consensus kernel learning
    Ren, Zhenwen
    Li, Haoran
    Yang, Chao
    Sun, Quansen
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 188
  • [7] A multiple kernel density clustering algorithm for incomplete datasets in bioinformatics
    Liao, Longlong
    Li, Kenli
    Li, Keqin
    Yang, Canqun
    Tian, Qi
    [J]. BMC SYSTEMS BIOLOGY, 2018, 12
  • [8] Veto-Consensus Multiple Kernel Learning
    Zhou, Yuxun
    Hu, Ninghang
    Spanos, Costas J.
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2407 - 2414
  • [9] Multiple Kernel Learning Clustering with an Application to Malware
    Anderson, Blake
    Storlie, Curtis
    Lane, Terran
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 804 - 809
  • [10] Multiple Kernel Learning Based on Cooperative Clustering
    Du, Haiyang
    Yin, Chuanhuan
    Mu, Shaomin
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, 2014, 8589 : 107 - 117