A mixture model for signature discovery from sparse mutation data

被引:0
|
作者
Itay Sason
Yuexi Chen
Mark D.M. Leiserson
Roded Sharan
机构
[1] Blavatnik School of Computer Science,Department of Computer Science and Center for Bioinformatics and Computational Biology
[2] Tel Aviv University,undefined
[3] University of Maryland,undefined
来源
关键词
Mutational signatures; Probabilistic modeling; Gene panel sequencing;
D O I
暂无
中图分类号
学科分类号
摘要
Mutational signatures are key to understanding the processes that shape cancer genomes, yet their analysis requires relatively rich whole-genome or whole-exome mutation data. Recently, orders-of-magnitude sparser gene-panel-sequencing data have become increasingly available in the clinic. To deal with such sparse data, we suggest a novel mixture model, Mix. In application to simulated and real gene-panel sequences, Mix is shown to outperform current approaches and yield mutational signatures and patient stratifications that are in higher agreement with the literature. We further demonstrate its utility in several clinical settings, successfully predicting therapy benefit and patient groupings from MSK-IMPACT pan-cancer data. Availability: https://github.com/itaysason/Mix-MMM.
引用
收藏
相关论文
共 50 条
  • [1] A mixture model for signature discovery from sparse mutation data
    Sason, Itay
    Chen, Yuexi
    Leiserson, Mark D. M.
    Sharan, Roded
    GENOME MEDICINE, 2021, 13 (01)
  • [2] A Biterm Topic Model for Sparse Mutation Data
    Sason, Itay
    Chen, Yuexi
    Leiserson, Mark D. M.
    Sharan, Roded
    CANCERS, 2023, 15 (05)
  • [3] A Bayesian sparse finite mixture model for clustering data from a heterogeneous population
    Saraiva, Erlandson F.
    Suzuki, Adriano K.
    Milan, Luis A.
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2020, 34 (02) : 323 - 344
  • [4] Causal Discovery on Discrete Data with Extensions to Mixture Model
    Liu, Furui
    Chan, Laiwan
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2016, 7 (02)
  • [5] Biomarker Signature Discovery from Mass Spectrometry Data
    Kong, Ao
    Gupta, Chinmaya
    Ferrari, Mauro
    Agostini, Marco
    Bedin, Chiara
    Bouamrani, Ali
    Tasciotti, Ennio
    Azencott, Robert
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (04) : 766 - 772
  • [6] Causal discovery from a mixture of experimental and observational data
    Cooper, GF
    Yoo, C
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1999, : 116 - 125
  • [7] Clustering sparse binary data with hierarchical Bayesian Bernoulli mixture model
    Ye, Mao
    Zhang, Peng
    Nie, Lizhen
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 123 : 32 - 49
  • [8] Efficient mixture model for clustering of sparse high dimensional binary data
    Marek Śmieja
    Krzysztof Hajto
    Jacek Tabor
    Data Mining and Knowledge Discovery, 2019, 33 : 1583 - 1624
  • [9] Efficient mixture model for clustering of sparse high dimensional binary data
    Smieja, Marek
    Hajto, Krzysztof
    Tabor, Jacek
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (06) : 1583 - 1624
  • [10] ConSIG: consistent discovery of molecular signature from OMIC data
    Li, Fengcheng
    Yin, Jiayi
    Lu, Mingkun
    Yang, Qingxia
    Zeng, Zhenyu
    Zhang, Bing
    Li, Zhaorong
    Qiu, Yunqing
    Dai, Haibin
    Chen, Yuzong
    Zhu, Feng
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)