MAGPIE: accurate pathogenic prediction for multiple variant types using machine learning approach

被引:6
|
作者
Liu, Yicheng [1 ,2 ,3 ]
Zhang, Tianyun [1 ,2 ]
You, Ningyuan [1 ,2 ]
Wu, Sai [2 ,3 ]
Shen, Ning [1 ,2 ]
机构
[1] Zhejiang Univ, Sch Med, Affiliated Hosp 1, Dept Hepatobiliary & Pancreat Surg, Hangzhou 310006, Peoples R China
[2] Zhejiang Univ, Liangzhu Lab, 1369 West Wenyi Rd, Hangzhou 311121, Peoples R China
[3] Zhejiang Univ, Coll Comp Sci, Yuquan Campus,Rd Zheda 38, Hangzhou 310007, Peoples R China
关键词
Pathogenic prediction; Multimodal annotation; Machine learning; Genomic variation; HUMAN GENE; MUTATIONS; DATABASE; IMPACT; SIFT;
D O I
10.1186/s13073-023-01274-4
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identifying pathogenic variants from the vast majority of nucleotide variation remains a challenge. We present a method named Multimodal Annotation Generated Pathogenic Impact Evaluator (MAGPIE) that predicts the pathogenicity of multi-type variants. MAGPIE uses the ClinVar dataset for training and demonstrates superior performance in both the independent test set and multiple orthogonal validation datasets, accurately predicting variant pathogenicity. Notably, MAGPIE performs best in predicting the pathogenicity of rare variants and highly imbalanced datasets. Overall, results underline the robustness of MAGPIE as a valuable tool for predicting pathogenicity in various types of human genome variations. MAGPIE is available at https://github.com/shenlab-genomics/magpie.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] A machine learning approach for optimizing and accurate prediction of performance parameters for stacked nanosheet transistor
    Kumar, Naveen
    Rajakumari, V
    Padhy, Ram Prasad
    Routray, S.
    Pradhan, K. P.
    PHYSICA SCRIPTA, 2024, 99 (04)
  • [42] Accurate compressive strength prediction using machine learning algorithms and optimization techniques
    Lan W.
    Journal of Engineering and Applied Science, 2024, 71 (01):
  • [43] Accurate prediction of band gap of materials using stacking machine learning model
    Wang, Teng
    Zhang, Kefei
    The, Jesse
    Yu, Hesheng
    COMPUTATIONAL MATERIALS SCIENCE, 2022, 201
  • [44] Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction
    Jumin, Ellysia
    Zaini, Nuratiah
    Ahmed, Ali Najah
    Abdullah, Samsuri
    Ismail, Marzuki
    Sherif, Mohsen
    Sefelnasr, Ahmed
    EI-Shafie, Ahmed
    ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2020, 14 (01) : 713 - 725
  • [45] A machine learning approach to the accurate prediction of multi-leaf collimator positional errors
    Carlson, Joel N. K.
    Park, Jong Min
    Park, So-Yeon
    Park, Jong In
    Choi, Yunseok
    Ye, Sung-Joon
    PHYSICS IN MEDICINE AND BIOLOGY, 2016, 61 (06): : 2514 - 2531
  • [46] Highly accurate classification of biological spores by culture medium for forensic attribution using multiple chemical signature types and machine learning
    Ippoliti, Paul
    Nargi, Fran
    Han, Jason
    Casale, Amanda
    Walsh, Matthew
    Boettcher, Tara
    Dettman, Josh
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2020, 412 (18) : 4287 - 4299
  • [47] Highly accurate classification of biological spores by culture medium for forensic attribution using multiple chemical signature types and machine learning
    Ippoliti, Paul
    Crenshaw, Michael
    Nargi, Frances
    Boettcher, Tara
    Walsh, Matthew
    Casale, Amanda
    Han, Jason
    Dettman, Joshua
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [48] Highly accurate classification of biological spores by culture medium for forensic attribution using multiple chemical signature types and machine learning
    Paul Ippoliti
    Fran Nargi
    Jason Han
    Amanda Casale
    Matthew Walsh
    Tara Boettcher
    Josh Dettman
    Analytical and Bioanalytical Chemistry, 2020, 412 : 4287 - 4299
  • [49] Fast and accurate identification of pathogenic bacteria using excitation-emission spectroscopy and machine learning
    Henry, Jacob
    Endres, Jennifer L.
    Sadykov, Marat R.
    Bayles, Kenneth W.
    Svechkarev, Denis
    SENSORS & DIAGNOSTICS, 2024, 3 (08):
  • [50] Comparison of machine learning algorithms for slope stability prediction using an automated machine learning approach
    Kurnaz, Talas Fikret
    Erden, Caner
    Dagdeviren, Ugur
    Demir, Alparslan Serhat
    Kokcam, Abdullah Hulusi
    NATURAL HAZARDS, 2024, 120 (08) : 6991 - 7014