Family-specific analysis of variant pathogenicity prediction tools

被引:8
|
作者
Zaucha, Jan [1 ]
Heinzinger, Michael [2 ]
Tarnovskaya, Svetlana [3 ]
Rost, Burkhard [2 ]
Frishman, Dmitrij [1 ]
机构
[1] Tech Univ Munich, Dept Bioinformat, D-85354 Freising Weihenstephan, Germany
[2] Tech Univ Munich, Dept Informat Bioinformat & Computat Biol I12, D-85748 Garching, Germany
[3] Almazov Natl Med Res Ctr, St Petersburg 197341, Russia
基金
俄罗斯科学基金会;
关键词
DATABASE; MUTATIONS; IMPACT;
D O I
10.1093/nargab/lqaa014
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Using the presently available datasets of annotated missense variants, we ran a protein family-specific benchmarking of tools for predicting the pathogenicity of single amino acid variants. We find that despite the high overall accuracy of all tested methods, each tool has its Achilles heel, i.e. protein families in which its predictions prove unreliable (expected accuracy does not exceed 51% in any method). As a proof of principle, we show that choosing the optimal tool and pathogenicity threshold at a protein family-individual level allows obtaining reliable predictions in all Pfam domains (accuracy no less than 68%). A functional analysis of the sets of protein domains annotated exclusively by neutral or pathogenic mutations indicates that specific protein functions can be associated with a high or low sensitivity to mutations, respectively. The highly sensitive sets of protein domains are involved in the regulation of transcription and DNA sequence-specific transcription factor binding, while the domains that do not result in disease when mutated are responsible for mediating immune and stress responses. These results suggest that future predictors of pathogenicity and especially variant prioritization tools may benefit from considering functional annotation.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Protein sequence alignment with family-specific amino acid similarity matrices
    Kuznetsov I.B.
    [J]. BMC Research Notes, 4 (1)
  • [42] Detection of Chlamydiaceae DNA in veterinary specimens using a family-specific PCR
    Condon, K.
    Oakey, J.
    [J]. LETTERS IN APPLIED MICROBIOLOGY, 2007, 45 (02) : 121 - 127
  • [43] Missense variant pathogenicity predictors generalize well across a range of function-specific prediction challenges
    Pejaver, Vikas
    Mooney, Sean D.
    Radivojac, Predrag
    [J]. HUMAN MUTATION, 2017, 38 (09) : 1092 - 1108
  • [44] VIPPID: a gene-specific single nucleotide variant pathogenicity prediction tool for primary immunodeficiency diseases
    Fang, Mingyan
    Su, Zheng
    Abolhassani, Hassan
    Itan, Yuval
    Jin, Xin
    Hammarstrom, Lennart
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [45] Genetic variant pathogenicity prediction trained using disease-specific clinical sequencing data sets
    Evans, Perry
    Wu, Chao
    Lindy, Amanda
    McKnight, Dianalee A.
    Lebo, Matthew
    Sarmady, Mahdi
    Abou Tayoun, Ahmad N.
    [J]. GENOME RESEARCH, 2019, 29 (07) : 1144 - 1151
  • [46] Semiparametric estimation of major gene and family-specific random effects for age of onset
    Li, HZ
    Thompson, E
    [J]. BIOMETRICS, 1997, 53 (01) : 282 - 293
  • [47] Family-specific aggregation of lipid GWAS variants confers the susceptibility to familial hypercholesterolemia in a large Austrian family
    Nikkola, Elina
    Ko, Arthur
    Alvarez, Marcus
    Cantor, Rita M.
    Garske, Kristina
    Kim, Elliot
    Gee, Stephanie
    Rodriguez, Alejandra
    Muxel, Reinhard
    Matikainen, Niina
    Soderlund, Sanni
    Motazacker, Mahdi M.
    Boren, Jan
    Lamina, Claudia
    Kronenberg, Florian
    Schneider, Wolfgang J.
    Palotie, Aarno
    Laakso, Markku
    Taskinen, Marja-Riitta
    Pajukanta, Paivi
    [J]. ATHEROSCLEROSIS, 2017, 264 : 58 - 66
  • [48] Leveraging family-specific signatures for AMP discovery and high-throughput annotation
    Faiza Hanif Waghu
    Ram Shankar Barai
    Susan Idicula-Thomas
    [J]. Scientific Reports, 6
  • [49] Evidence for instrument and family-specific variation of subclinical psychosis dimensions in the general population
    Hanssen, M
    Krabbendam, L
    Vollema, M
    Delespaul, P
    Van Os, J
    [J]. JOURNAL OF ABNORMAL PSYCHOLOGY, 2006, 115 (01) : 5 - 14
  • [50] Applicability of multigene family-specific antibodies toward studies of the subtilases in Arabidopsis thaliana
    Li, Ming
    Wan, William N.
    Petrova, Olga
    Huang, Feilei
    Zhou, Zhenzhen
    Boyd, Patrick
    Wilson, Karl A.
    Tan-Wilson, Anna
    [J]. ANALYTICAL BIOCHEMISTRY, 2009, 384 (01) : 114 - 122