Prioritizing Disease-Linked Variants, Genes, and Pathways with an Interactive Whole-Genome Analysis Pipeline

被引:18
|
作者
Lee, In-Hee [1 ]
Lee, Kyungjoon [2 ]
Hsing, Michael [1 ]
Choe, Yongjoon [1 ]
Park, Jin-Ho [1 ,3 ]
Kim, Shu Hee [4 ]
Bohn, Justin M. [1 ]
Neu, Matthew B. [1 ]
Hwang, Kyu-Baek [5 ]
Green, Robert C. [6 ]
Kohane, Isaac S. [1 ,2 ]
Kong, Sek Won [1 ]
机构
[1] Childrens Hosp, Harvard Div Hlth Sci & Technol, Childrens Hosp Informat Program, Dept Med, Boston, MA 02115 USA
[2] Harvard Univ, Ctr Biomed Informat, Sch Med, Boston, MA 02115 USA
[3] Seoul Natl Univ Hosp, Dept Family Med, Seoul 110744, South Korea
[4] Stanford Univ, Palo Alto, CA 94305 USA
[5] Soongsil Univ, Sch Comp Sci & Engn, Seoul 156743, South Korea
[6] Brigham & Womens Hosp, Dept Med, Div Genet, Boston, MA 02115 USA
关键词
whole-genome sequences; variant annotation; disease gene discovery; analysis pipeline; RARE-VARIANT; PERSONAL GENOMES; UVEAL MELANOMA; MUTATIONS; SEQUENCE; EXOME; DATABASE; TOOL; FRAMEWORK; COMMON;
D O I
10.1002/humu.22520
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Whole-genome sequencing (WGS) studies are uncovering disease-associated variants in both rare and nonrare diseases. Utilizing the next-generation sequencing for WGS requires a series of computational methods for alignment, variant detection, and annotation, and the accuracy and reproducibility of annotation results are essential for clinical implementation. However, annotating WGS with up to date genomic information is still challenging for biomedical researchers. Here, we present one of the fastest and highly scalable annotation, filtering, and analysis pipelinegNOMEto prioritize phenotype-associated variants while minimizing false-positive findings. Intuitive graphical user interface of gNOME facilitates the selection of phenotype-associated variants, and the result summaries are provided at variant, gene, and genome levels. Moreover, the enrichment results of specific variants, genes, and gene sets between two groups or compared with population scale WGS datasets that is already integrated in the pipeline can help the interpretation. We found a small number of discordant results between annotation software tools in part due to different reporting strategies for the variants with complex impacts. Using two published whole-exome datasets of uveal melanoma and bladder cancer, we demonstrated gNOME's accuracy of variant annotation and the enrichment of loss-of-function variants in known cancer pathways. gNOME Web server and source codes are freely available to the academic community ().
引用
收藏
页码:537 / 547
页数:11
相关论文
共 50 条
  • [1] A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease
    Smedley, Damian
    Schubach, Max
    Jacobsen, Julius O. B.
    Koehler, Sebastian
    Zemojtel, Tomasz
    Spielmann, Malte
    Jaeger, Marten
    Hochheiser, Harry
    Washington, Nicole L.
    McMurry, Julie A.
    Haendel, Melissa A.
    Mungall, Christopher J.
    Lewis, Suzanna E.
    Groza, Tudor
    Valentini, Giorgio
    Robinson, Peter N.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2016, 99 (03) : 595 - 606
  • [2] Whole-genome sequencing analysis of structural variants in oesophageal adenocarcinoma
    Contino, Gianmarco
    Secrier, Maria
    Edward, Paul A. W.
    Fitzgerald, Rebecca
    LANCET, 2017, 389 : 34 - 34
  • [3] VCF.Filter: interactive prioritization of disease-linked genetic variants from sequencing data
    Mueller, Heiko
    Jimenez-Heredia, Raul
    Krolo, Ana
    Hirschmugl, Tatjana
    Dmytrus, Jasmin
    Boztug, Kaan
    Bock, Christoph
    NUCLEIC ACIDS RESEARCH, 2017, 45 (W1) : W567 - W572
  • [4] msPIPE: a pipeline for the analysis and visualization of whole-genome bisulfite sequencing data
    Heesun Kim
    Mikang Sim
    Nayoung Park
    Kisang Kwon
    Junyoung Kim
    Jaebum Kim
    BMC Bioinformatics, 23
  • [5] msPIPE: a pipeline for the analysis and visualization of whole-genome bisulfite sequencing data
    Kim, Heesun
    Sim, Mikang
    Park, Nayoung
    Kwon, Kisang
    Kim, Junyoung
    Kim, Jaebum
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [6] Analysis of genetic variants in protein-coding genes of Aoluguya reindeer based on the whole-genome data
    Yi, Wenfeng
    Hu, Mingyue
    Shi, Lulu
    Li, Ting
    Sun, Hao
    Qin, Lihong
    Yan, Shouqing
    ANIMAL GENETICS, 2024, 55 (02) : 296 - 298
  • [7] Whole-Genome Profile of Greek Patients with Teratozοοspermia: Identification of Candidate Variants and Genes
    Kyrgiafini, Maria-Anna
    Giannoulis, Themistoklis
    Chatziparasidou, Alexia
    Christoforidis, Nikolaos
    Mamuris, Zissis
    GENES, 2022, 13 (09)
  • [8] Prospects for whole-genome linkage disequilibrium mapping of common disease genes
    L Kruglyak
    Nature Genetics, 1999, 22 : 139 - 144
  • [9] Prospects for whole-genome linkage disequilibrium mapping of common disease genes
    Kruglyak, L
    NATURE GENETICS, 1999, 22 (02) : 139 - 144
  • [10] Uncovering the roles of rare variants in common disease through whole-genome sequencing
    Elizabeth T. Cirulli
    David B. Goldstein
    Nature Reviews Genetics, 2010, 11 : 415 - 425