A new correlation clustering method for cancer mutation analysis

被引:11
|
作者
Hou, Jack P. [1 ,2 ]
Emad, Amin [3 ,4 ]
Puleo, Gregory J. [3 ,4 ]
Ma, Jian [1 ,5 ,6 ]
Milenkovic, Olgica [3 ,4 ]
机构
[1] Univ Illinois, Dept Bioengn, Urbana, IL 61801 USA
[2] Univ Illinois, Med Scholars Program, Urbana, IL 61801 USA
[3] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
[4] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA
[5] Univ Illinois, Carl R Woese Inst Genom Biol, Urbana, IL 61801 USA
[6] Carnegie Mellon Univ, Sch Comp Sci, Computat Biol Dept, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
WHITNEY U-TEST; DRIVER PATHWAYS; MUTUAL EXCLUSIVITY; SOMATIC MUTATIONS; IDENTIFICATION; IMPACT; GENES; COMBINATIONS; EXPRESSION; DISCOVERY;
D O I
10.1093/bioinformatics/btw546
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cancer genomes exhibit a large number of different alterations that affect many genes in a diverse manner. An improved understanding of the generative mechanisms behind the mutation rules and their influence on gene community behavior is of great importance for the study of cancer. Results: To expand our capability to analyze combinatorial patterns of cancer alterations, we developed a rigorous methodology for cancer mutation pattern discovery based on a new, constrained form of correlation clustering. Our new algorithm, named C-3 (Cancer Correlation Clustering), leverages mutual exclusivity of mutations, patient coverage and driver network concentration principles. To test C-3, we performed a detailed analysis on TCGA breast cancer and glioblastoma data and showed that our algorithm outperforms the state-of-the-art CoMEt method in terms of discovering mutually exclusive gene modules and identifying biologically relevant driver genes. The proposed agnostic clustering method represents a unique tool for efficient and reliable identification of mutation patterns and driver pathways in large-scale cancer genomics studies, and it may also be used for other clustering problems on biological graphs. Availability and Implementation: The source code for the C-3 method can be found at https://github.com/jackhou2/C-3 Contacts: jianma@cs.cmu.eduormilenkov@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:3717 / 3728
页数:12
相关论文
共 50 条
  • [1] A new machine learning method for cancer mutation analysis
    Habibi, Mahnaz
    Taheri, Golnaz
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (10)
  • [2] A new clustering method for microarray data analysis
    Zhang, LX
    Zhu, S
    [J]. CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, : 268 - 275
  • [3] Adaptive Missing Texture Reconstruction Method Based on Kernel Canonical Correlation Analysis with a New Clustering Scheme
    Ogawa, Takahiro
    Haseyama, Miki
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2009, E92A (08) : 1950 - 1960
  • [4] DMCM: a Data-adaptive Mutation Clustering Method to identify cancer-related mutation clusters
    Lu, Xinguo
    Qian, Xin
    Li, Xing
    Miao, Qiumai
    Peng, Shaoliang
    [J]. BIOINFORMATICS, 2019, 35 (03) : 389 - 397
  • [5] Hierarchical clustering of the correlation patterns: New method of domain identification in proteins
    Yesylevskyy, SO
    Kharkyanen, VN
    Demchenko, AP
    [J]. BIOPHYSICAL CHEMISTRY, 2006, 119 (01) : 84 - 93
  • [6] An adaptive alert correlation method based on pattern mining and clustering analysis
    Tian, Zhihong
    Zhang, Yongzheng
    Zhang, Weizhe
    Li, Yang
    Ye, Jianwei
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (08): : 1304 - 1315
  • [7] A New Clustering Method for the Distribution Analysis of Hearing Neurons
    Cheng, Kuo-Sheng
    Pan, Shih-Ming
    [J]. WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING 2006, VOL 14, PTS 1-6, 2007, 14 : 2296 - +
  • [8] Clustering analysis of microarray gene expression data with new clustering ensemble method
    Luo, Fei
    Liu, Juan
    [J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 500 - 504
  • [9] New correlation analysis method for nonstationary signal
    Li, Zuojin
    Shi, Weiren
    Wang, Kai
    Sun, Xiaowei
    Zhong, Zhi
    [J]. WSEAS Transactions on Information Science and Applications, 2008, 5 (12): : 1677 - 1686
  • [10] A correlation-matrix-based hierarchical clustering method for functional connectivity analysis
    Liu, Xiao
    Zhu, Xiao-Hong
    Qiu, Peihua
    Chen, Wei
    [J]. JOURNAL OF NEUROSCIENCE METHODS, 2012, 211 (01) : 94 - 102