Comprehensive hepatotoxicity prediction: ensemble model integrating machine learning and deep learning

被引:0
|
作者
Khan, Muhammad Zafar Irshad [1 ]
Ren, Jia-Nan [1 ]
Cao, Cheng [1 ,2 ]
Ye, Hong-Yu-Xiang [1 ]
Wang, Hao [1 ]
Guo, Ya-Min [1 ]
Yang, Jin-Rong [1 ,2 ]
Chen, Jian-Zhong [1 ]
机构
[1] Zhejiang Univ, Coll Pharmaceut Sci, Hangzhou, Peoples R China
[2] Zhejiang Univ, Polytech Inst, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
hepatotoxicity; ensemble model; molecular fingerprints; machine learning; deep learning; LIVER-INJURY; DRUG;
D O I
10.3389/fphar.2024.1441587
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Background Chemicals may lead to acute liver injuries, posing a serious threat to human health. Achieving the precise safety profile of a compound is challenging due to the complex and expensive testing procedures. In silico approaches will aid in identifying the potential risk of drug candidates in the initial stage of drug development and thus mitigating the developmental cost.Methods In current studies, QSAR models were developed for hepatotoxicity predictions using the ensemble strategy to integrate machine learning (ML) and deep learning (DL) algorithms using various molecular features. A large dataset of 2588 chemicals and drugs was randomly divided into training (80%) and test (20%) sets, followed by the training of individual base models using diverse machine learning or deep learning based on three different kinds of descriptors and fingerprints. Feature selection approaches were employed to proceed with model optimizations based on the model performance. Hybrid ensemble approaches were further utilized to determine the method with the best performance.Results The voting ensemble classifier emerged as the optimal model, achieving an excellent prediction accuracy of 80.26%, AUC of 82.84%, and recall of over 93% followed by bagging and stacking ensemble classifiers method. The model was further verified by an external test set, internal 10-fold cross-validation, and rigorous benchmark training, exhibiting much better reliability than the published models.Conclusion The proposed ensemble model offers a dependable assessment with a good performance for the prediction regarding the risk of chemicals and drugs to induce liver damage.
引用
下载
收藏
页数:15
相关论文
共 50 条
  • [1] Enhancing Question Pairs Identification with Ensemble Learning: Integrating Machine Learning and Deep Learning Models
    Tarek, Salsabil
    Noaman, Hatem M.
    Kayed, Mohammed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 981 - 992
  • [2] Hybrid deep learning model for ozone concentration prediction: comprehensive evaluation and comparison with various machine and deep learning algorithms
    Yafouz, Ayman
    Ahmed, Ali Najah
    Zaini, Nur'atiah
    Sherif, Mohsen
    Sefelnasr, Ahmed
    El-Shafie, Ahmed
    ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2021, 15 (01) : 902 - 933
  • [3] Prediction of anticancer peptides based on an ensemble model of deep learning and machine learning using ordinal positional encoding
    Yuan, Qitong
    Chen, Keyi
    Yu, Yimin
    Le, Nguyen Quoc Khanh
    Chua, Matthew Chin Heng
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
  • [4] Novel Machine Learning Method Integrating Ensemble Learning and Deep Learning for Mapping Debris-Covered Glaciers
    Lu, Yijie
    Zhang, Zhen
    Shangguan, Donghui
    Yang, Junhua
    REMOTE SENSING, 2021, 13 (13)
  • [5] Mixed learning algorithms and features ensemble in hepatotoxicity prediction
    Chin Yee Liew
    Yen Ching Lim
    Chun Wei Yap
    Journal of Computer-Aided Molecular Design, 2011, 25 : 855 - 871
  • [6] Mixed learning algorithms and features ensemble in hepatotoxicity prediction
    Liew, Chin Yee
    Lim, Yen Ching
    Yap, Chun Wei
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2011, 25 (09) : 855 - 871
  • [8] A comprehensive evaluation of statistical, machine learning and deep learning models for time series prediction
    Xuan, Ang
    Yin, Mengmeng
    Li, Yupei
    Chen, Xiyu
    Ma, Zhenliang
    2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 55 - 60
  • [9] A Comprehensive Review on Crop Disease Prediction Based on Machine Learning and Deep Learning Techniques
    Patil, Manoj A.
    Manohar, M.
    THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 481 - 503
  • [10] Optimized ensemble machine learning model for software bugs prediction
    Femi Johnson
    Olayiwola Oluwatobi
    Olusegun Folorunso
    Alomaja Victor Ojumu
    Alatishe Quadri
    Innovations in Systems and Software Engineering, 2023, 19 : 91 - 101