Mixed learning algorithms and features ensemble in hepatotoxicity prediction

被引:0
|
作者
Chin Yee Liew
Yen Ching Lim
Chun Wei Yap
机构
[1] National University of Singapore,Department of Pharmacy, Pharmaceutical Data Exploration Laboratory
关键词
Ensemble; Consensus; Meta-learner; Mixed variables; Mixed algorithm; Prediction; Drug-induced liver injuries; Hepatotoxicity; Drug discovery; Support vector machine; k-Nearest neighbor; Naive Bayes; QSTR;
D O I
暂无
中图分类号
学科分类号
摘要
Drug-induced liver injury, although infrequent, is an important safety concern that can lead to fatality in patients and failure in drug developments. In this study, we have used an ensemble of mixed learning algorithms and mixed features for the development of a model to predict hepatic effects. This robust method is based on the premise that no single learning algorithm is optimum for all modelling problems. An ensemble model of 617 base classifiers was built from a diverse set of 1,087 compounds. The ensemble model was validated internally with five-fold cross-validation and 25 rounds of y-randomization. In the external validation of 120 compounds, the ensemble model had achieved an accuracy of 75.0%, sensitivity of 81.9% and specificity of 64.6%. The model was also able to identify 22 of 23 withdrawn drugs or drugs with black box warning against hepatotoxicity. Dronedarone which is associated with severe liver injuries, announced in a recent FDA drug safety communication, was predicted as hepatotoxic by the ensemble model. It was found that the ensemble model was capable of classifying positive compounds (with hepatic effects) well, but less so on negatives compounds when they were structurally similar. The ensemble model built in this study is made available for public use.
引用
收藏
页码:855 / 871
页数:16
相关论文
共 50 条
  • [31] Hybrid of Ensemble Machine Learning and Nature-Inspired Algorithms for Divorce Prediction
    Sahle, Kalkidan A.
    Yibre, Abdulkerim M.
    PAN-AFRICAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PT II, PANAFRICON AI 2023, 2024, 2069 : 242 - 264
  • [32] Prediction of the sulfate resistance for recycled aggregate concrete based on ensemble learning algorithms
    Liu, Kaihua
    Dai, Zihang
    Zhang, Rongbin
    Zheng, Jiakai
    Zhu, Jiang
    Yang, Xincong
    CONSTRUCTION AND BUILDING MATERIALS, 2022, 317
  • [33] Daily reservoir inflow prediction using stacking ensemble of machine learning algorithms
    Deb, Deepjyoti
    Arunachalam, Vasan
    Raju, K. Srinivasa
    JOURNAL OF HYDROINFORMATICS, 2024, 26 (05) : 972 - 997
  • [34] A comparative analysis of ensemble learning algorithms with hyperparameter optimization for soil liquefaction prediction
    Demir, Alparslan Serhat
    Kurnaz, Talas Fikret
    Koekcam, Abdullah Hulusi
    Erden, Caner
    Dagdeviren, Ugur
    ENVIRONMENTAL EARTH SCIENCES, 2024, 83 (09)
  • [35] Image Classification Using Ensemble Algorithms with Deep Learning and Hand-Crafted Features
    Tasci, Erdal
    Ugur, Aybars
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [36] Shape complexity analysis of building features with shape indices and ensemble learning classification algorithms
    Duman, Huseyin Safa
    Basaraner, Melih
    GEOMATIK, 2022, 7 (03): : 197 - 208
  • [37] KFPredict: An ensemble learning prediction framework for diabetes based on fusion of key features
    Qi, Huamei
    Song, Xiaomeng
    Liu, Shengzong
    Zhang, Yan
    Wong, Kelvin K. L.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 231
  • [38] Compound Property Prediction Based on Multiple Different Molecular Features and Ensemble Learning
    Yang, Wenming
    Zou, Jiali
    Yin, Le
    CCKS 2022 - EVALUATION TRACK, 2022, 1711 : 57 - 69
  • [39] Analysis of protein features and machine learning algorithms for prediction of druggable proteins
    Tanlin Sun
    Luhua Lai
    Jianfeng Pei
    Quantitative Biology, 2018, 6 (04) : 334 - 343
  • [40] Analysis of protein features and machine learning algorithms for prediction of druggable proteins
    Sun, Tanlin
    Lai, Luhua
    Pei, Jianfeng
    QUANTITATIVE BIOLOGY, 2018, 6 (04) : 334 - 343