ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data

被引:10143
|
作者
Wang, Kai [1 ]
Li, Mingyao [2 ]
Hakonarson, Hakon [1 ,3 ]
机构
[1] Childrens Hosp Philadelphia, Ctr Appl Genom, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Pediat, Philadelphia, PA 19104 USA
关键词
SNPS; ASSOCIATION; GENOMES;
D O I
10.1093/nar/gkq603
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput sequencing platforms are generating massive amounts of genetic variation data for diverse genomes, but it remains a challenge to pinpoint a small subset of functionally important variants. To fill these unmet needs, we developed the ANNOVAR tool to annotate single nucleotide variants (SNVs) and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP. ANNOVAR can utilize annotation databases from the UCSC Genome Browser or any annotation data set conforming to Generic Feature Format version 3 (GFF3). We also illustrate a 'variants reduction' protocol on 4.7 million SNVs and indels from a human genome, including two causal mutations for Miller syndrome, a rare recessive disease. Through a stepwise procedure, we excluded variants that are unlikely to be causal, and identified 20 candidate genes including the causal gene. Using a desktop computer, ANNOVAR requires similar to 4 min to perform gene-based annotation and similar to 15 min to perform variants reduction on 4.7 million variants, making it practical to handle hundreds of human genomes in a day. ANNOVAR is freely available at http://www.openbioinformatics.org/annovar/.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] High-throughput sequencing data and the impact of plant gene annotation quality
    Vaattovaara, Aleksia
    Leppala, Johanna
    Salojarvi, Jarkko
    Wrzaczek, Michael
    [J]. JOURNAL OF EXPERIMENTAL BOTANY, 2019, 70 (04) : 1069 - 1076
  • [2] Functional annotation of lncRNA in high-throughput screening
    Yip, Chi Wai
    Sivaraman, Divya M.
    Prabhu, Anika V.
    Shin, Jay W.
    [J]. NON-CODING GENOME, 2021, 65 (04): : 761 - 773
  • [3] Genetic variants in fat- and short-tailed sheep from high-throughput RNA-sequencing data
    Ma, L.
    Li, Z.
    Cai, Y.
    Xu, H.
    Yang, R.
    Lan, X.
    [J]. ANIMAL GENETICS, 2018, 49 (05) : 483 - 487
  • [4] Improved detection of artifactual viral minority variants in high-throughput sequencing data
    Welkers, Matthijs R. A.
    Jonges, Marcel
    Jeeninga, Rienk E.
    Koopmans, Marion P. G.
    de Jong, Menno D.
    [J]. FRONTIERS IN MICROBIOLOGY, 2015, 5
  • [5] Analysis of High-Throughput Sequencing and Annotation Strategies for Phage Genomes
    Henn, Matthew R.
    Sullivan, Matthew B.
    Stange-Thomann, Nicole
    Osburne, Marcia S.
    Berlin, Aaron M.
    Kelly, Libusha
    Yandava, Chandri
    Kodira, Chinnappa
    Zeng, Qiandong
    Weiand, Michael
    Sparrow, Todd
    Saif, Sakina
    Giannoukos, Georgia
    Young, Sarah K.
    Nusbaum, Chad
    Birren, Bruce W.
    Chisholm, Sallie W.
    [J]. PLOS ONE, 2010, 5 (02):
  • [6] An efficient population genetic analysis method for high-throughput sequencing data
    Li, Jie
    Qian, Jiating
    Ding, Xi
    Ling, Yayue
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 48 - 49
  • [7] High-Throughput Sequencing and Rare Genetic Diseases
    Makrythanasis, P.
    Antonarakis, S. E.
    [J]. MOLECULAR SYNDROMOLOGY, 2012, 3 (05) : 197 - 203
  • [8] Searching for rare genetic variants associated with thrombosis using high-throughput sequencing technology
    Vrtel, Radek
    Vrtel, Petr
    Vodicka, Radek
    Slavik, Ludek
    Prochazkova, Jana
    Ulehlova, Jana
    Stellmachova, Julia
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 163 - 163
  • [9] Detecting Alu insertions from high-throughput sequencing data
    David, Matei
    Mustafa, Harun
    Brudno, Michael
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (17)
  • [10] High-throughput functional annotation and data mining with the Blast2GO suite
    Gotz, Stefan
    Garcia-Gomez, Juan Miguel
    Terol, Javier
    Williams, Tim D.
    Nagaraj, Shivashankar H.
    Nueda, Maria Jose
    Robles, Montserrat
    Talon, Manuel
    Dopazo, Joaquin
    Conesa, Ana
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 (10) : 3420 - 3435