ExomeHMM: A Hidden Markov Model for Detecting Copy Number Variation Using Whole-Exome Sequencing Data

被引:5
|
作者
Guo, Cheng [1 ]
Yu, Zhenhua [1 ]
Wang, Minghui [1 ,2 ]
Li, Ao [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Res Ctr Biomed Engn, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Copy number variation; expectation-maximization algorithm; hidden Markov model; next generation sequencing; viterbi algorithm; whole-exome sequencing; BREAST-CANCER; ACCURATE DETECTION; ANALYSIS TOOLKIT; HETEROZYGOSITY; VARIANTS; IDENTIFICATION; EXPRESSION; CAPTURE; DISEASE;
D O I
10.2174/1574893611666160727160757
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Copy number variations (CNVs), including amplification and deletion, are alterations of DNA copy number compared to a reference genome. CNVs play a crucial role in tumourigenesis and progression, including amplification of oncogenes and deletion of tumor suppressor genes that may significantly increase the risk of cancer. CNVs are also reported to be closely related with non-cancer diseases, such as Down syndrome, Parkinson disease, and Alzheimer disease. Objective: Whole-exome sequencing (WES) has been successfully applied to the discovery of gene mutations as well as clinical diagnosis. But it is quite challenging to evaluate the copy number using WES data due to read depth bias, exons' distribution pattern and normal cell contamination. Our aim is develop an efficient method to overcome these challenges and detect CNVs using WES data. Method: In this study, we present ExomeHMM, a hidden Markov model (HMM) based CNV detecting algorithm. ExomeHMM exploits relative read depth, a ratio based signal, to mitigate read depth distortion and employs exponential attenuated transition matrix to handle sparsely and non-uniformly distributed exons. Expectation-maximization algorithm is used to optimize parameters for the proposed model. Finally, we use standard Viterbi algorithm to infer the copy number of exons. Results: Using previously identified CNVs in 1000 Genome Project data as golden standard, ExomeHMM achieves the highest F-score among the four methods compared in this study. When applied to triple-negative breast cancer data, ExomeHMM is capable to find abnormal genes that are significantly associated with breast cancer. Conclusion: In conclusion, ExomeHMM is a suitable tool for CNV detections in both healthy samples as well as clinic tumor samples on whole-exome sequencing data.
引用
下载
收藏
页码:147 / 155
页数:9
相关论文
共 50 条
  • [41] Detecting Identity by Descent and Homozygosity Mapping in Whole-Exome Sequencing Data
    Zhuang, Zhong
    Gusev, Alexander
    Cho, Judy
    Pe'er, Itsik
    PLOS ONE, 2012, 7 (10):
  • [42] Application of Whole-Exome Sequencing in for updates Detecting Copy Number Variants in Patients with Developmental Delay and/or Multiple Congenital Malformations
    Zanardo, Evelin A.
    Monteiro, Fabiola P.
    Chehimi, Samar N.
    Oliveira, Yanca G.
    Dias, Alexandre T.
    Costa, Larissa A.
    Ramos, Luiza L.
    Novo-Filho, Gil M.
    Montenegro, Marilia M.
    Nascimento, Amom M.
    Kitajima, Joao P.
    Kok, Fernando
    Kulikowski, Leslie D.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2020, 22 (08): : 1041 - 1049
  • [43] A Comparison of Tools for Copy-Number Variation Detection in Germline Whole Exome and Whole Genome Sequencing Data
    Gabrielaite, Migle
    Torp, Mathias Husted
    Rasmussen, Malthe Sebro
    Andreu-Sanchez, Sergio
    Vieira, Filipe Garrett
    Pedersen, Christina Bligaard
    Kinalis, Savvas
    Madsen, Majbritt Busk
    Kodama, Miyako
    Demircan, Guel Sude
    Simonyan, Arman
    Yde, Christina Westmose
    Olsen, Lars Ronn
    Marvig, Rasmus L.
    ostrup, Olga
    Rossing, Maria
    Nielsen, Finn Cilius
    Winther, Ole
    Bagger, Frederik Otzen
    CANCERS, 2021, 13 (24)
  • [44] inCNV: An Integrated Analysis Tool for Copy Number Variation on Whole Exome Sequencing
    Chanwigoon, Saowwapark
    Piwluang, Sakkayaphab
    Wichadakul, Duangdao
    EVOLUTIONARY BIOINFORMATICS, 2020, 16
  • [45] CODEX: a normalization and copy number variation detection method for whole exome sequencing
    Jiang, Yuchao
    Oldridge, Derek A.
    Diskin, Sharon J.
    Zhang, Nancy R.
    NUCLEIC ACIDS RESEARCH, 2015, 43 (06) : e39
  • [46] Erratum to: CoNVEX: copy number variation estimation in exome sequencing data using HMM
    Kaushalya C Amarasinghe
    Jason Li
    Saman K Halgamuge
    BMC Bioinformatics, 14 (Suppl 2)
  • [47] ExCNVSS: A Noise-Robust Method for Copy Number Variation Detection in Whole Exome Sequencing Data
    Kong, Jinhwa
    Shin, Jaemoon
    Won, Jungim
    Lee, Keonbae
    Lee, Unjoo
    Yoon, Jeehee
    BIOMED RESEARCH INTERNATIONAL, 2017, 2017
  • [48] Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants
    Belkadi, Aziz
    Bolze, Alexandre
    Itan, Yuval
    Cobat, Aurelie
    Vincent, Quentin B.
    Antipenko, Alexander
    Shang, Lei
    Boisson, Bertrand
    Casanova, Jean-Laurent
    Abel, Laurent
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (17) : 5473 - 5478
  • [49] Copy number variants in a large cohort analysed with whole-exome sequencing: lessons for genetic diagnosis
    Lopes, Fatima
    Lopes, Alexandra M.
    Silva, Paulo
    Sousa, Susana
    Morais, Sara
    Sa, Joana
    Brandao, Ana Filipa
    Lopes, Ana
    Bastos-Ferreira, Rita
    Freixo, Joao Parente
    Sequeiros, Jorge
    Oliveira, Jorge
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2022, 30 (SUPPL 1) : 470 - 470
  • [50] Whole-Exome Sequencing Characterizes the Landscape of Somatic Mutations and Copy Number Alterations in Adrenocortical Carcinoma
    Juhlin, C. Christofer
    Goh, Gerald
    Healy, James M.
    Fonseca, Annabelle L.
    Scholl, Ute I.
    Stenman, Adam
    Kunstman, John W.
    Brown, Taylor C.
    Overton, John D.
    Mane, Shrikant M.
    Nelson-Williams, Carol
    Backdahl, Martin
    Suttorp, Anna-Carinna
    Haase, Matthias
    Choi, Murim
    Schlessinger, Joseph
    Rimm, David L.
    Hoog, Anders
    Prasad, Manju L.
    Korah, Reju
    Larsson, Catharina
    Lifton, Richard P.
    Carling, Tobias
    JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2015, 100 (03): : E493 - E502