Prioritizing Disease-Linked Variants, Genes, and Pathways with an Interactive Whole-Genome Analysis Pipeline

被引:18
|
作者
Lee, In-Hee [1 ]
Lee, Kyungjoon [2 ]
Hsing, Michael [1 ]
Choe, Yongjoon [1 ]
Park, Jin-Ho [1 ,3 ]
Kim, Shu Hee [4 ]
Bohn, Justin M. [1 ]
Neu, Matthew B. [1 ]
Hwang, Kyu-Baek [5 ]
Green, Robert C. [6 ]
Kohane, Isaac S. [1 ,2 ]
Kong, Sek Won [1 ]
机构
[1] Childrens Hosp, Harvard Div Hlth Sci & Technol, Childrens Hosp Informat Program, Dept Med, Boston, MA 02115 USA
[2] Harvard Univ, Ctr Biomed Informat, Sch Med, Boston, MA 02115 USA
[3] Seoul Natl Univ Hosp, Dept Family Med, Seoul 110744, South Korea
[4] Stanford Univ, Palo Alto, CA 94305 USA
[5] Soongsil Univ, Sch Comp Sci & Engn, Seoul 156743, South Korea
[6] Brigham & Womens Hosp, Dept Med, Div Genet, Boston, MA 02115 USA
关键词
whole-genome sequences; variant annotation; disease gene discovery; analysis pipeline; RARE-VARIANT; PERSONAL GENOMES; UVEAL MELANOMA; MUTATIONS; SEQUENCE; EXOME; DATABASE; TOOL; FRAMEWORK; COMMON;
D O I
10.1002/humu.22520
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Whole-genome sequencing (WGS) studies are uncovering disease-associated variants in both rare and nonrare diseases. Utilizing the next-generation sequencing for WGS requires a series of computational methods for alignment, variant detection, and annotation, and the accuracy and reproducibility of annotation results are essential for clinical implementation. However, annotating WGS with up to date genomic information is still challenging for biomedical researchers. Here, we present one of the fastest and highly scalable annotation, filtering, and analysis pipelinegNOMEto prioritize phenotype-associated variants while minimizing false-positive findings. Intuitive graphical user interface of gNOME facilitates the selection of phenotype-associated variants, and the result summaries are provided at variant, gene, and genome levels. Moreover, the enrichment results of specific variants, genes, and gene sets between two groups or compared with population scale WGS datasets that is already integrated in the pipeline can help the interpretation. We found a small number of discordant results between annotation software tools in part due to different reporting strategies for the variants with complex impacts. Using two published whole-exome datasets of uveal melanoma and bladder cancer, we demonstrated gNOME's accuracy of variant annotation and the enrichment of loss-of-function variants in known cancer pathways. gNOME Web server and source codes are freely available to the academic community ().
引用
收藏
页码:537 / 547
页数:11
相关论文
共 50 条
  • [21] BacSeq: A User-Friendly Automated Pipeline for Whole-Genome Sequence Analysis of Bacterial Genomes
    Chukamnerd, Arnon
    Jeenkeawpiam, Kongpop
    Chusri, Sarunyou
    Pomwised, Rattanaruji
    Singkhamanan, Kamonnut
    Surachat, Komwit
    MICROORGANISMS, 2023, 11 (07)
  • [22] Prediction and analysis of metagenomic operons via MetaRon: a pipeline for prediction of Metagenome and whole-genome opeRons
    Syed Shujaat Ali Zaidi
    Masood Ur Rehman Kayani
    Xuegong Zhang
    Younan Ouyang
    Imran Haider Shamsi
    BMC Genomics, 22
  • [23] MycoVarP: Mycobacterium Variant and Drug Resistance Prediction Pipeline for Whole-Genome Sequence Data Analysis
    Swargam, Sandeep
    Kumari, Indu
    Kumar, Amit
    Pradhan, Dibyabhaba
    Alam, Anwar
    Singh, Harpreet
    Jain, Anuja
    Devi, Kangjam Rekha
    Trivedi, Vishal
    Sarma, Jogesh
    Hanif, Mahmud
    Narain, Kanwar
    Ehtesham, Nasreen Zafar
    Hasnain, Seyed Ehtesham
    Ahmad, Shandar
    FRONTIERS IN BIOINFORMATICS, 2022, 1
  • [24] Prediction and analysis of metagenomic operons via MetaRon: a pipeline for prediction of Metagenome and whole-genome opeRons
    Zaidi, Syed Shujaat Ali
    Kayani, Masood Ur Rehman
    Zhang, Xuegong
    Ouyang, Younan
    Shamsi, Imran Haider
    BMC GENOMICS, 2021, 22 (01)
  • [25] Whole-genome sequencing identifies associations of sequence variants with clinically relevant urinary disease markers
    Benonisdottir, S.
    Oddsson, A.
    Sulem, G.
    Kristjansson, R. P.
    Olafsson, I.
    Onundarson, P. T.
    Kehr, B.
    Arnadottir, G. A.
    Holm, H.
    Masson, G.
    Edvardsson, V. O.
    Palsson, R.
    Jonasdottir, A.
    Jonasdottir, A.
    Mikaelsdottir, E.
    Eyjolfsson, G. I.
    Jensson, B. O.
    Thorsteinsdottir, U.
    Gudbjartsson, D. F.
    Sulem, P.
    Stefansson, K.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 92 - 93
  • [26] Retention and loss of amino acid biosynthetic pathways based on analysis of whole-genome sequences
    Payne, SH
    Loomis, WF
    EUKARYOTIC CELL, 2006, 5 (02) : 272 - 276
  • [27] Whole-Genome Identification and Comparative Expression Analysis of Anthocyanin Biosynthetic Genes in Brassica napus
    He, Dan
    Zhang, Dawei
    Li, Ting
    Liu, Lili
    Zhou, Dinggang
    Kang, Lei
    Wu, Jinfeng
    Liu, Zhongsong
    Yan, Mingli
    FRONTIERS IN GENETICS, 2021, 12
  • [28] Whole-genome analysis of genes regulated by the Bacillus subtilis competence transcription factor ComK
    Ogura, M
    Yamaguchi, H
    Kobayashi, K
    Ogasawara, N
    Fujita, Y
    Tanaka, T
    JOURNAL OF BACTERIOLOGY, 2002, 184 (09) : 2344 - 2351
  • [29] Whole-genome analysis of two-component signal transduction genes in fungal pathogens
    Catlett, NL
    Yoder, OC
    Turgeon, BG
    EUKARYOTIC CELL, 2003, 2 (06) : 1151 - 1161
  • [30] Variants of uncertain significance of the cancer-predisposing genes in two thousand Japanese whole-genome sequencing data
    Yasuda, Jun
    Iida, Keita
    Kumada, Kazuki
    Ogishima, Soichi
    Shibuya, Yusuke
    Tokunaga, Hideki
    Yaegashi, Nobuo
    CANCER RESEARCH, 2018, 78 (13)