A mixture model for signature discovery from sparse mutation data

被引:0
|
作者
Itay Sason
Yuexi Chen
Mark D.M. Leiserson
Roded Sharan
机构
[1] Blavatnik School of Computer Science,Department of Computer Science and Center for Bioinformatics and Computational Biology
[2] Tel Aviv University,undefined
[3] University of Maryland,undefined
来源
关键词
Mutational signatures; Probabilistic modeling; Gene panel sequencing;
D O I
暂无
中图分类号
学科分类号
摘要
Mutational signatures are key to understanding the processes that shape cancer genomes, yet their analysis requires relatively rich whole-genome or whole-exome mutation data. Recently, orders-of-magnitude sparser gene-panel-sequencing data have become increasingly available in the clinic. To deal with such sparse data, we suggest a novel mixture model, Mix. In application to simulated and real gene-panel sequences, Mix is shown to outperform current approaches and yield mutational signatures and patient stratifications that are in higher agreement with the literature. We further demonstrate its utility in several clinical settings, successfully predicting therapy benefit and patient groupings from MSK-IMPACT pan-cancer data. Availability: https://github.com/itaysason/Mix-MMM.
引用
收藏
相关论文
共 50 条
  • [41] Proteomic Signature of Acute Liver Failure: From Discovery and Verification in a Pig Model to Confirmation in Humans
    Wang, Jie
    Sun, Zeyu
    Jiang, Jing
    Wu, Daxian
    Liu, Xiaoli
    Xie, Zhongyang
    Chen, Ermei
    Zhu, Danhua
    Ye, Chao
    Zhang, Xiaoqian
    Chen, Wenqian
    Cao, Hongcui
    Li, Lanjuan
    MOLECULAR & CELLULAR PROTEOMICS, 2017, 16 (07) : 1188 - 1199
  • [42] Knowledge Discovery of Complex Data Using Gaussian Mixture Models
    Zhou, Linfei
    Ye, Wei
    Plant, Claudia
    Boehm, Christian
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2017, 2017, 10440 : 409 - 423
  • [43] On sparse regression, Lp-regularization, and automated model discovery
    McCulloch, Jeremy A.
    St Pierre, Skyler R.
    Linka, Kevin
    Kuhl, Ellen
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2024, 125 (14)
  • [44] GROUP SPARSE BAYESIAN LEARNING FOR DATA-DRIVEN DISCOVERY OF EXPLICIT MODEL FORMS WITH MULTIPLE PARAMETRIC DATASETS
    Sun, Luning
    Du, Pan
    Sun, Hao
    Wang, Jian-Xun
    NUMERICAL ALGEBRA CONTROL AND OPTIMIZATION, 2024, 14 (01): : 190 - 213
  • [45] A Sparse Topic Model for Bursty Topic Discovery in Social Networks
    Shi, Lei
    Du, Junping
    Kou, Feifei
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (05) : 816 - 824
  • [46] Model selection for prognostic time-to-event gene signature discovery with applications in early breast cancer data
    Ahdesmaeki, Miika
    Lancashire, Lee
    Proutski, Vitali
    Wilson, Claire
    Davison, Timothy S.
    Harkin, D. Paul
    Kennedy, Richard D.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2013, 12 (05) : 619 - 635
  • [47] A MIXTURE MODEL APPROACH FOR ESTIMATING CROP AREAS FROM LANDSAT DATA
    LENNINGTON, RK
    SORENSEN, CT
    REMOTE SENSING OF ENVIRONMENT, 1984, 14 (1-3) : 197 - 206
  • [48] Unsupervised learning from incomplete data using a mixture model approach
    Hunt, L
    Jorgensen, M
    STATISTICAL DATA MINING AND KNOWLEDGE DISCOVERY, 2004, : 173 - 191
  • [49] A Causal Dirichlet Mixture Model for Causal Inference from Observational Data
    Lin, Adi
    Lu, Jie
    Xuan, Junyu
    Zhu, Fujin
    Zhang, Guangquan
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (03)
  • [50] Mutation Discovery in Regions of Segmental Cancer Genome Amplifications with CoNAn-SNV: A Mixture Model for Next Generation Sequencing of Tumors
    Crisan, Anamaria
    Goya, Rodrigo
    Ha, Gavin
    Ding, Jiarui
    Prentice, Leah M.
    Oloumi, Arusha
    Senz, Janine
    Zeng, Thomas
    Tse, Kane
    Delaney, Allen
    Marra, Marco A.
    Huntsman, David G.
    Hirst, Martin
    Aparicio, Sam
    Shah, Sohrab
    PLOS ONE, 2012, 7 (08):