Medoidshift clustering applied to genomic bulk tumor data

被引:3
|
作者
Roman, Theodore [1 ,2 ]
Xie, Lu [1 ,2 ]
Schwartz, Russell [1 ,3 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Computat Biol Dept, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
[2] Joint Carnegie Mellon Univ Pittsburgh, PhD Program Computat Biol, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
[3] Carnegie Mellon Univ, Mellon Coll Sci, Dept Biol Sci, 4400 Fifth Ave, Pittsburgh, PA 15213 USA
来源
BMC GENOMICS | 2016年 / 17卷
关键词
Computational biology; Clustering; Tumor; Heterogeneity; INTRATUMOR HETEROGENEITY; SEQUENCING REVEALS; GENETIC-ANALYSIS; WHOLE-GENOME; IN-SITU; CELL; EVOLUTION; EXPRESSION; MUTATIONS; CARCINOMA;
D O I
10.1186/s12864-015-2302-x
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Despite the enormous medical impact of cancers and intensive study of their biology, detailed characterization of tumor growth and development remains elusive. This difficulty occurs in large part because of enormous heterogeneity in the molecular mechanisms of cancer progression, both tumor-to-tumor and cell-to-cell in single tumors. Advances in genomic technologies, especially at the single-cell level, are improving the situation, but these approaches are held back by limitations of the biotechnologies for gathering genomic data from heterogeneous cell populations and the computational methods for making sense of those data. One popular way to gain the advantages of whole-genome methods without the cost of single-cell genomics has been the use of computational deconvolution (unmixing) methods to reconstruct clonal heterogeneity from bulk genomic data. These methods, too, are limited by the difficulty of inferring genomic profiles of rare or subtly varying clonal subpopulations from bulk data, a problem that can be computationally reduced to that of reconstructing the geometry of point clouds of tumor samples in a genome space. Here, we present a new method to improve that reconstruction by better identifying subspaces corresponding to tumors produced from mixtures of distinct combinations of clonal subpopulations. We develop a nonparametric clustering method based on medoidshift clustering for identifying subgroups of tumors expected to correspond to distinct trajectories of evolutionary progression. We show on synthetic and real tumor copy-number data that this new method substantially improves our ability to resolve discrete tumor subgroups, a key step in the process of accurately deconvolving tumor genomic data and inferring clonal heterogeneity from bulk data.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Feature selection for genomic data sets through feature clustering
    Zheng, Fengbin
    Shen, Xiajiong
    Fu, Zhengye
    Zheng, Shanshan
    Li, Guangrong
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (02) : 228 - 240
  • [32] Comparative Analysis of Genomic Signal Processing for Microarray Data Clustering
    Istepanian, Robert S. H.
    Sungoor, Ala
    Nebel, Jean-Christophe
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2011, 10 (04) : 225 - 238
  • [33] SUBTYPING GLIOBLASTOMA FROM JOINT CLUSTERING OF IMAGING AND GENOMIC DATA
    Guo, Jun
    Fathi-Kazerooni, Anahita
    Toorens, Erik
    Akbari, Hamed
    Yu, Fanyang
    Matsumoto, Yuji
    Sako, Chiharu
    Mamourian, Elizabeth
    Shinohara, Russell
    Koumenis, Constantinos
    Verginadis, Ioannis
    Bagley, Stephen
    Morrissette, Jennifer
    Binder, Zev
    Brem, Steven
    Mohan, Suyash
    Lustig, Robert
    O'Rourke, Donald
    Ganguly, Tapan
    Bakas, Spyridon
    Aboian, Mariam
    Nasrallah, Maclean
    Davatzikos, Christos
    NEURO-ONCOLOGY, 2024, 26
  • [34] An artificial immune network model applied to data clustering and classification
    Zhang, Chenggong
    Yi, Zhang
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 526 - +
  • [35] Chemodynamical Clustering Applied to APOGEE Data: Rediscovering Globular Clusters
    Chen, Boquan
    D'Onghia, Elena
    Pardy, Stephen A.
    Pasquali, Anna
    Motta, Clio Bertelli
    Hanlon, Bret
    Grebel, Eva K.
    ASTROPHYSICAL JOURNAL, 2018, 860 (01):
  • [36] Clustering Algorithm Optimization Applied to Metagenomics Using Big Data
    Vanegas, Julian
    Bonet, Isis
    INFORMATION AND COMMUNICATION TECHNOLOGIES OF ECUADOR (TIC.EC), 2019, 884 : 182 - 192
  • [37] Text data mining applied to clustering with cost effective tools
    Moebes, TA
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 2787 - 2794
  • [38] THE MIXTURE METHOD OF CLUSTERING APPLIED TO 3-WAY DATA
    BASFORD, KE
    MCLACHLAN, GJ
    JOURNAL OF CLASSIFICATION, 1985, 2 (01) : 109 - 125
  • [39] Clustering Techniques Applied on Cross-Sectional Unemployment Data
    Balsa, Carlos
    Nunes, Alcina
    Barros, Elisa
    DYNAMICS, GAMES AND SCIENCE, 2015, 1 : 71 - 88
  • [40] A Hierarchical Approach to Represent Relational Data Applied to Clustering Tasks
    Xavier-Junior, Joao C.
    Canuto, Anne M. P.
    Freitas, Alex A.
    Goncalves, Luiz M. G.
    Silla, Carlos N., Jr.
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 3055 - 3062