A new correlation clustering method for cancer mutation analysis

被引:11
|
作者
Hou, Jack P. [1 ,2 ]
Emad, Amin [3 ,4 ]
Puleo, Gregory J. [3 ,4 ]
Ma, Jian [1 ,5 ,6 ]
Milenkovic, Olgica [3 ,4 ]
机构
[1] Univ Illinois, Dept Bioengn, Urbana, IL 61801 USA
[2] Univ Illinois, Med Scholars Program, Urbana, IL 61801 USA
[3] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
[4] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA
[5] Univ Illinois, Carl R Woese Inst Genom Biol, Urbana, IL 61801 USA
[6] Carnegie Mellon Univ, Sch Comp Sci, Computat Biol Dept, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
WHITNEY U-TEST; DRIVER PATHWAYS; MUTUAL EXCLUSIVITY; SOMATIC MUTATIONS; IDENTIFICATION; IMPACT; GENES; COMBINATIONS; EXPRESSION; DISCOVERY;
D O I
10.1093/bioinformatics/btw546
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cancer genomes exhibit a large number of different alterations that affect many genes in a diverse manner. An improved understanding of the generative mechanisms behind the mutation rules and their influence on gene community behavior is of great importance for the study of cancer. Results: To expand our capability to analyze combinatorial patterns of cancer alterations, we developed a rigorous methodology for cancer mutation pattern discovery based on a new, constrained form of correlation clustering. Our new algorithm, named C-3 (Cancer Correlation Clustering), leverages mutual exclusivity of mutations, patient coverage and driver network concentration principles. To test C-3, we performed a detailed analysis on TCGA breast cancer and glioblastoma data and showed that our algorithm outperforms the state-of-the-art CoMEt method in terms of discovering mutually exclusive gene modules and identifying biologically relevant driver genes. The proposed agnostic clustering method represents a unique tool for efficient and reliable identification of mutation patterns and driver pathways in large-scale cancer genomics studies, and it may also be used for other clustering problems on biological graphs. Availability and Implementation: The source code for the C-3 method can be found at https://github.com/jackhou2/C-3 Contacts: jianma@cs.cmu.eduormilenkov@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:3717 / 3728
页数:12
相关论文
共 50 条
  • [41] A meta-search method with clustering and term correlation
    Zhao, DJ
    Lee, DL
    Luo, Q
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 543 - 553
  • [42] HER2 somatic mutation analysis in breast cancer: correlation with clinicopathological features
    Ding, Qingqing
    Chen, Hui
    Lim, Bora
    Damodaran, Senthil
    Chen, Weiwei
    Tripathy, Debu
    Piha-Paul, Sarina
    Luthra, Rajyalakshmi
    Meric-Bernstam, Funda
    Sahin, Aysegul A.
    [J]. HUMAN PATHOLOGY, 2019, 92 : 32 - 38
  • [43] HER2 Somatic Mutation Analysis in Breast Cancer: Correlation with Clinicopathologic Features
    Ding, Qingqing
    Chen, Hui
    Lim, Bora
    Damodaran, Senthil
    Chen, Weiwei
    Tripathy, Debu
    Piha-Paul, Sarina A.
    Luthra, Rajyalakshmi
    Broaddus, Russell R.
    Meric-Bernstam, Funda
    Sahin, Aysegul A.
    [J]. LABORATORY INVESTIGATION, 2017, 97 : 39A - 39A
  • [44] HER2 Somatic Mutation Analysis in Breast Cancer: Correlation with Clinicopathologic Features
    Ding, Qingqing
    Chen, Hui
    Lim, Bora
    Damodaran, Senthil
    Chen, Weiwei
    Tripathy, Debu
    Piha-Paul, Sarnia A.
    Luthra, Rajyalakshmi
    Broaddus, Russell R.
    Meric-Bernstam, Funda
    Sahin, Aysegul A.
    [J]. MODERN PATHOLOGY, 2017, 30 : 39A - 39A
  • [45] A new correlation analysis method used in behavioural science research
    Chou, Jyh-Rong
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2013, 39 (09): : 105 - 129
  • [46] A New Dimensionality Reduction Method with Correlation Analysis and Universum Learning
    Chen X.
    Wang L.
    [J]. Pattern Recognition and Image Analysis, 2018, 28 (2) : 174 - 184
  • [47] A new correlation analysis method used in behavioural science research
    [J]. Chou, J.R. (jrchou@isu.edu.tw), 1600, CESER Publications, Post Box No. 113, Roorkee, 247667, India (39):
  • [48] Factor evaluation model based on entropy method and spearman correlation analysis and ISODATA clustering algorithm
    Wang, Ziming
    Sun, Chen
    Liu, Zewei
    Liu, Haijing
    [J]. ENVIRONMENTAL SCIENCE AND INFORMATION APPLICATION TECHNOLOGY, 2015, : 297 - 302
  • [49] Network Loss Allocation Method Based on Improved K-means Clustering And Correlation Analysis
    Chen Xinxin
    Liu Wenying
    Wang Fangyu
    Liu Fuchao
    Wang Geng
    [J]. 2020 5TH ASIA CONFERENCE ON POWER AND ELECTRICAL ENGINEERING (ACPEE 2020), 2020, : 37 - 41
  • [50] Prediction method of wind farm power generation capacity based on feature clustering and correlation analysis
    Wang, Yajun
    Wang, Jidong
    Cao, Man
    Li, Weixun
    Yuan, Long
    Wang, Ning
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2022, 212