MAGPIE: accurate pathogenic prediction for multiple variant types using machine learning approach

被引:6
|
作者
Liu, Yicheng [1 ,2 ,3 ]
Zhang, Tianyun [1 ,2 ]
You, Ningyuan [1 ,2 ]
Wu, Sai [2 ,3 ]
Shen, Ning [1 ,2 ]
机构
[1] Zhejiang Univ, Sch Med, Affiliated Hosp 1, Dept Hepatobiliary & Pancreat Surg, Hangzhou 310006, Peoples R China
[2] Zhejiang Univ, Liangzhu Lab, 1369 West Wenyi Rd, Hangzhou 311121, Peoples R China
[3] Zhejiang Univ, Coll Comp Sci, Yuquan Campus,Rd Zheda 38, Hangzhou 310007, Peoples R China
关键词
Pathogenic prediction; Multimodal annotation; Machine learning; Genomic variation; HUMAN GENE; MUTATIONS; DATABASE; IMPACT; SIFT;
D O I
10.1186/s13073-023-01274-4
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identifying pathogenic variants from the vast majority of nucleotide variation remains a challenge. We present a method named Multimodal Annotation Generated Pathogenic Impact Evaluator (MAGPIE) that predicts the pathogenicity of multi-type variants. MAGPIE uses the ClinVar dataset for training and demonstrates superior performance in both the independent test set and multiple orthogonal validation datasets, accurately predicting variant pathogenicity. Notably, MAGPIE performs best in predicting the pathogenicity of rare variants and highly imbalanced datasets. Overall, results underline the robustness of MAGPIE as a valuable tool for predicting pathogenicity in various types of human genome variations. MAGPIE is available at https://github.com/shenlab-genomics/magpie.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] An Approach for Severity Prediction of Autism Using Machine Learning
    Che, Min
    Wang, Liya
    Huang, Lin
    Jiang, Zhibin
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2019, : 701 - 705
  • [32] Machine Learning Approach for Rapid, Accurate Point-of-Care Prediction of M-Spike Values in Multiple Myeloma
    Malek, Ehsan
    Wang, Gi-Ming
    Tatsuoka, Curtis
    Cullen, Jennifer
    Madabhushi, Anant
    Driscoll, James J.
    JCO CLINICAL CANCER INFORMATICS, 2023, 7
  • [33] Machine Learning Approach for Rapid, Accurate Point-of-Care Prediction of M-Spike Values in Multiple Myeloma
    Malek, Ehsan
    Wang, Gi-Ming
    Tatsuoka, Curtis
    Cullen, Jennifer
    Madabhushi, Anant
    Driscoll, James J.
    JCO CLINICAL CANCER INFORMATICS, 2023, 7 : e2300078
  • [34] Variant Effect Prediction in the Age of Machine Learning
    Bromberg, Yana
    Prabakaran, R.
    Kabir, Anowarul
    Shehu, Amarda
    COLD SPRING HARBOR PERSPECTIVES IN BIOLOGY, 2024, 16 (07):
  • [35] Fall Risk Prediction in Multiple Sclerosis Using Postural Sway Measures: A Machine Learning Approach
    Ruopeng Sun
    Katherine L. Hsieh
    Jacob J. Sosnoff
    Scientific Reports, 9
  • [36] Fall Risk Prediction in Multiple Sclerosis Using Postural Sway Measures: A Machine Learning Approach
    Sun, Ruopeng
    Hsieh, Katherine L.
    Sosnoff, Jacob J.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [37] Comprehensive accurate prediction of critical jet fuel properties with multiple machine learning models
    Shao, Yitong
    Yu, Mengxian
    Zhao, Mengchao
    Xue, Kang
    Zhang, Xiangwen
    Zou, Ji-Jun
    Pan, Lun
    CHEMICAL ENGINEERING SCIENCE, 2025, 304
  • [38] A Review on Proteomic Function Prediction in Pathogenic Bacterial Organism Using Machine Learning
    Vijay, Anushri
    Tiwari, Neha
    Sharma, Amita
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL. 2, 2023, 448 : 891 - 900
  • [39] Rapid and Accurate Prediction of the Axial Magnetic Anisotropy in Cobalt(II) Complexes Using a Machine-Learning Approach
    Silva Junior, Henrique C.
    Menezes, Heloisa N. S.
    Ferreira, Glaucio B.
    Guedes, Guilherme P.
    INORGANIC CHEMISTRY, 2023, 62 (37) : 14838 - 14842
  • [40] Accurate prediction of pressure losses using machine learning for the pipeline transportation of emulsions
    Hafsa, Noor
    Rushd, Sayeed
    Alzoubi, Hadeel
    Al-Faiad, Majdi
    HELIYON, 2024, 10 (01)