Medoidshift clustering applied to genomic bulk tumor data

被引:3
|
作者
Roman, Theodore [1 ,2 ]
Xie, Lu [1 ,2 ]
Schwartz, Russell [1 ,3 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Computat Biol Dept, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
[2] Joint Carnegie Mellon Univ Pittsburgh, PhD Program Computat Biol, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
[3] Carnegie Mellon Univ, Mellon Coll Sci, Dept Biol Sci, 4400 Fifth Ave, Pittsburgh, PA 15213 USA
来源
BMC GENOMICS | 2016年 / 17卷
关键词
Computational biology; Clustering; Tumor; Heterogeneity; INTRATUMOR HETEROGENEITY; SEQUENCING REVEALS; GENETIC-ANALYSIS; WHOLE-GENOME; IN-SITU; CELL; EVOLUTION; EXPRESSION; MUTATIONS; CARCINOMA;
D O I
10.1186/s12864-015-2302-x
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Despite the enormous medical impact of cancers and intensive study of their biology, detailed characterization of tumor growth and development remains elusive. This difficulty occurs in large part because of enormous heterogeneity in the molecular mechanisms of cancer progression, both tumor-to-tumor and cell-to-cell in single tumors. Advances in genomic technologies, especially at the single-cell level, are improving the situation, but these approaches are held back by limitations of the biotechnologies for gathering genomic data from heterogeneous cell populations and the computational methods for making sense of those data. One popular way to gain the advantages of whole-genome methods without the cost of single-cell genomics has been the use of computational deconvolution (unmixing) methods to reconstruct clonal heterogeneity from bulk genomic data. These methods, too, are limited by the difficulty of inferring genomic profiles of rare or subtly varying clonal subpopulations from bulk data, a problem that can be computationally reduced to that of reconstructing the geometry of point clouds of tumor samples in a genome space. Here, we present a new method to improve that reconstruction by better identifying subspaces corresponding to tumors produced from mixtures of distinct combinations of clonal subpopulations. We develop a nonparametric clustering method based on medoidshift clustering for identifying subgroups of tumors expected to correspond to distinct trajectories of evolutionary progression. We show on synthetic and real tumor copy-number data that this new method substantially improves our ability to resolve discrete tumor subgroups, a key step in the process of accurately deconvolving tumor genomic data and inferring clonal heterogeneity from bulk data.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Medoidshift clustering applied to genomic bulk tumor data
    Theodore Roman
    Lu Xie
    Russell Schwartz
    BMC Genomics, 17
  • [2] Improved geometric deconvolution of bulk tumor genomic data
    Roman, Theodore
    Xie, Lu
    Schwartz, Russell
    CANCER RESEARCH, 2016, 76
  • [3] Automating deconvolution of heterogeneous bulk tumor genomic data
    Roman, Theodore
    Xiao, Brenda
    Schwartz, Russell
    CANCER RESEARCH, 2017, 77
  • [4] Clustering: Applied to Data Structuring and
    Iloanusi, Ogechukwu N.
    Osuagwu, Charles C.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2011, 2 (11) : 100 - 105
  • [5] Interactive data analysis and clustering of genomic data
    Ciaramella, A.
    Cocozza, S.
    Iorio, F.
    Miele, G.
    Napolitano, F.
    Pinelli, M.
    Raiconi, G.
    Tagliaferri, R.
    NEURAL NETWORKS, 2008, 21 (2-3) : 368 - 378
  • [6] Genomic Data Clustering on FPGAs for Compression
    Petraglio, Enrico
    Wertenbroek, Rick
    Capitao, Flavio
    Guex, Nicolas
    Iseli, Christian
    Thoma, Yann
    APPLIED RECONFIGURABLE COMPUTING, 2017, 10216 : 229 - 240
  • [7] Improved deconvolution of heterogeneous tumor data to reconstruct clonal evolution from bulk genomic samples
    Roman, Theodore
    Schwartz, Russell
    CANCER RESEARCH, 2015, 75
  • [8] INTEGRATING TUMOR MICROENVIRONMENT WITH GENOMIC ABERRATIONS AND TUMOR CLASS IN ADULT GLIAL/GLIONEURONAL TUMORS BY DECONVOLUTION OF BULK METHYLATION DATA
    Singh, Omkar
    Aldape, Kenneth
    Pratt, Drew
    NEURO-ONCOLOGY, 2021, 23 : 118 - 118
  • [9] Genomic data integration using guided clustering
    Maneck, Matthias
    Schrader, Alexandra
    Kube, Dieter
    Spang, Rainer
    BIOINFORMATICS, 2011, 27 (16) : 2231 - 2238
  • [10] Interaction Based Functional Clustering of Genomic Data
    Krishna, Ritesh
    Li, Chang-Tsun
    Buchanan-Wollaston, Vicky
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, 2009, : 130 - +