DMCM: a Data-adaptive Mutation Clustering Method to identify cancer-related mutation clusters

被引:24
|
作者
Lu, Xinguo [1 ]
Qian, Xin [1 ]
Li, Xing [1 ]
Miao, Qiumai [1 ]
Peng, Shaoliang [1 ,2 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Hunan, Peoples R China
关键词
SOMATIC MUTATIONS; GENOMES;
D O I
10.1093/bioinformatics/bty624
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Functional somatic mutations within coding amino acid sequences confer growth advantage in pathogenic process. Most existing methods for identifying cancer-related mutations focus on the single amino acid or the entire gene level. However, gain-of-function mutations often cluster in specific protein regions instead of existing independently in the amino acid sequences. Some approaches for identifying mutation clusters with mutation density on amino acid chain have been proposed recently. But their performance in identification of mutation clusters remains to be improved. Results: Here we present a Data-adaptive Mutation Clustering Method ( DMCM), in which kernel density estimate (KDE) with a data-adaptive bandwidth is applied to estimate the mutation density, to find variable clusters with different lengths on amino acid sequences. We apply this approach in the mutation data of 571 genes in over twenty cancer types from The Cancer Genome Atlas (TCGA). We compare the DMCM with (MC)-C-2, OncodriveCLUST and Pfam Domain and find that DMCM tends to identify more significant clusters. The cross-validation analysis shows DMCM is robust and cluster cancer type enrichment analysis shows that specific cancer types are enriched for specific mutation clusters.
引用
收藏
页码:389 / 397
页数:9
相关论文
共 50 条
  • [21] Differential analysis between somatic mutation and germline variation profiles reveals cancer-related genes
    Pawel F. Przytycki
    Mona Singh
    [J]. Genome Medicine, 9
  • [22] Identifying relationships between imaging phenotypes and lung cancer-related mutation status: EGFR and KRAS
    Pinheiro, Gil
    Pereira, Tania
    Dias, Catarina
    Freitas, Claudia
    Hespanhol, Venceslau
    Costa, Jose Luis
    Cunha, Antonio
    Oliveira, Helder P.
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [23] Differential analysis between somatic mutation and germline variation profiles reveals cancer-related genes
    Przytycki, Pawel F.
    Singh, Mona
    [J]. GENOME MEDICINE, 2017, 9
  • [24] Benchmarking mutation effect prediction algorithms using functionally validated cancer-related missense mutations
    Luciano G Martelotto
    Charlotte KY Ng
    Maria R De Filippo
    Yan Zhang
    Salvatore Piscuoglio
    Raymond S Lim
    Ronglai Shen
    Larry Norton
    Jorge S Reis-Filho
    Britta Weigelt
    [J]. Genome Biology, 15
  • [25] Quality of life and its relation to cancer-related stress in women of families with hereditary cancer without demonstrated mutation
    Geirdal, AO
    Mæhle, L
    Heimdal, K
    Stormorken, A
    Moller, P
    Dahl, AA
    [J]. QUALITY OF LIFE RESEARCH, 2006, 15 (03) : 461 - 470
  • [26] ANALYSIS OF 1200 CANCER-RELATED GENES FOR PANCREATIC CANCER GERMLINE SUSCEPTIBILITY VARIANTS AND PATIENT SURVIVAL BY MUTATION STATUS
    Paknikar, Raghavendra
    Alleyne, Dereck
    Brown, Miguel
    Perez, Edgar
    Buschmann, Mary
    Stoll, Jessica
    Kirby, Kori
    Roggin, Kevin K.
    Kupfer, Sonia S.
    [J]. GASTROENTEROLOGY, 2019, 156 (06) : S319 - S320
  • [27] Quality of Life and its Relation to Cancer-Related Stress in Women of Families with Hereditary Cancer without Demonstrated Mutation
    Amy Østertun Geirdal
    Lovise Mæhle
    Ketil Heimdal
    Astrid Stormorken
    Pål Møller
    Alv A. Dahl
    [J]. Quality of Life Research, 2006, 15 : 461 - 470
  • [28] Intergenerational communication and cancer-related beliefs within family: a resilience factor after BRCA mutation disclosure?
    Grados, C.
    Barrault, M.
    M'Bailara, K.
    [J]. PSYCHOLOGY & HEALTH, 2012, 27 : 220 - 220
  • [29] Correction to: Differential analysis between somatic mutation and germline variation profiles reveals cancer-related genes
    Pawel F. Przytycki
    Mona Singh
    [J]. Genome Medicine, 10
  • [30] Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer-Related Mortality
    Sarkar, Tapasree Roy
    Maity, Arnab Kumar
    Niu, Yabo
    Mallick, Bani K.
    [J]. CANCER INFORMATICS, 2019, 18