Prediction and Screening Model for Products Based on Fusion Regression and XGBoost Classification

被引:9
|
作者
Wu, Jiaju [1 ,2 ]
Kong, Linggang [2 ]
Yi, Ming [2 ]
Chen, Qiuxian [2 ]
Cheng, Zheng [2 ]
Zuo, Hongfu [1 ]
Yang, Yonghui [2 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Civil Aviat, Nanjing 210016, Peoples R China
[2] China Acad Engn Phys, Inst Comp Applicat, Mianyang 621900, Sichuan, Peoples R China
关键词
ADMET EVALUATION; DRUG; DESIGN;
D O I
10.1155/2022/4987639
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Performance prediction based on candidates and screening based on predicted performance value are the core of product development. For example, the performance prediction and screening of equipment components and parts are an important guarantee for the reliability of equipment products. The prediction and screening of drug bioactivity value and performance are the keys to pharmaceutical product development. The main reasons for the failure of pharmaceutical discovery are the low bioactivity of the candidate compounds and the deficiencies in their efficacy and safety, which are related to the absorption, distribution, metabolism, excretion, and toxicity (ADMET) of the compounds. Therefore, it is very necessary to quickly and effectively perform systematic bioactivity value prediction and ADMET property evaluation for candidate compounds in the early stage of drug discovery. In this paper, a data-driven pharmaceutical products screening prediction model is proposed to screen drug candidates with higher bioactivity value and better ADMETproperties. First, a quantitative prediction method for bioactivity value is proposed using the fusion regression of LGBM and neural network based on backpropagation (BP-NN). Then, the ADMET properties prediction method is proposed using XGBoost. According to the predicted bioactivity value and ADMET properties, the BVAP method is defined to screen the drug candidates. And the screening model is validated on the dataset of antagonized Era active compounds, in which the mean square error (MSE) of fusion regression is 1.1496, the XGBoost prediction accuracy of ADMET properties are 94.0% for Caco-2, 95.7% for CYP3A4, 89.4% for HERG, 88.6% for hob, and 96.2% for Mn. Compared with the commonly used methods for ADMET properties such as SVM, RF, KNN, LDA, and NB, the XGBoost in this paper has the highest prediction accuracy and AUC value, which has better guiding significance and can help screen pharmaceutical product candidates with good bioactivity, pharmacokinetic properties, and safety.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Research on orthopedic auxiliary classification and prediction model based on XGBoost algorithm
    Shenglong Li
    Xiaojing Zhang
    [J]. Neural Computing and Applications, 2020, 32 : 1971 - 1979
  • [2] Research on orthopedic auxiliary classification and prediction model based on XGBoost algorithm
    Li, Shenglong
    Zhang, Xiaojing
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07): : 1971 - 1979
  • [3] Robust Air Quality Prediction Based on Regression and XGBoost
    Varghese, Angel Ann
    Krishnadas, J.
    Anly, Antony M.
    [J]. 2023 ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES FOR HIGH PERFORMANCE APPLICATIONS, ACCTHPA, 2023,
  • [4] Classification and prediction of spinal disease based on the SMOTE-RFE- XGBoost model
    Zhang, Biao
    Dong, Xinyan
    Hu, Yuwei
    Jiang, Xuchu
    Li, Gongchi
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [5] A neural network boosting regression model based on XGBoost
    Dong, Jianwei
    Chen, Yumin
    Yao, Bingyu
    Zhang, Xiao
    Zeng, Nianfeng
    [J]. APPLIED SOFT COMPUTING, 2022, 125
  • [6] A neural network boosting regression model based on XGBoost
    Dong, Jianwei
    Chen, Yumin
    Yao, Bingyu
    Zhang, Xiao
    Zeng, Nianfeng
    [J]. APPLIED SOFT COMPUTING, 2022, 125
  • [7] Automatic Multichannel Electrocardiogram Record Classification Using XGBoost Fusion Model
    Ye, Xiaohong
    Huang, Yuanqi
    Lu, Qiang
    [J]. FRONTIERS IN PHYSIOLOGY, 2022, 13
  • [8] Quality Classification Model of Material products Based on Ordinal Logistic regression
    Mu, Pengfei
    Zhang, DongLing
    Xu, Xiaomei
    Liu, Yang
    [J]. ADVANCED MATERIALS AND ITS APPLICATION, 2012, 460 : 393 - 397
  • [9] Truck Parking Occupancy Prediction: XGBoost-LSTM Model Fusion
    Gutmann, Sebastian
    Maget, Christoph
    Spangler, Matthias
    Bogenberger, Klaus
    [J]. FRONTIERS IN FUTURE TRANSPORTATION, 2021, 2
  • [10] Prediction Model of Bone Marrow Infiltration in Patients with Malignant Lymphoma Based on Logistic Regression and XGBoost Algorithm
    Huang, Yongfen
    Chen, Can
    Miao, Yuqing
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022