Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure

被引:142
|
作者
Shi, Han [1 ,2 ,3 ]
Liu, Simin [1 ,2 ,3 ]
Chen, Junqi [1 ,2 ,3 ]
Li, Xuan [3 ]
Ma, Qin [4 ]
Yu, Bin [1 ,2 ,5 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Math & Phys, Qingdao 266061, Shandong, Peoples R China
[2] Qingdao Univ Sci & Technol, Artificial Intelligence & Biomed Big Data Res Ctr, Qingdao 266061, Shandong, Peoples R China
[3] Chinese Acad Sci, CAS Ctr Excellence Mol Plant Sci, Inst Plant Physiol & Ecol, Shanghai Inst Biol Sci,Key Lab Synthet Biol, Shanghai 200032, Peoples R China
[4] Ohio State Univ, Coll Med, Dept Biomed Informat, Columbus, OH 43210 USA
[5] Univ Sci & Technol China, Sch Life Sci, Hefei 230027, Anhui, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Drug-target interactions; Pseudo-position specific scoring matrix; Molecular fingerprint; Lasso; SMOTE; Random forest; AMINO-ACID-COMPOSITION; LARGE-SCALE PREDICTION; INTERACTION NETWORKS; IDENTIFICATION; DATABASE; KERNELS;
D O I
10.1016/j.ygeno.2018.12.007
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The identification of drug-target interactions has great significance for pharmaceutical scientific research. Since traditional experimental methods identifying drug-target interactions is costly and time-consuming, the use of machine learning methods to predict potential drug-target interactions has attracted widespread attention. This paper presents a novel drug-target interactions prediction method called LRF-DTIs. Firstly, the pseudo-position specific scoring matrix (PsePSSM) and FP2 molecular fingerprinting were used to extract the features of drug-target. Secondly, using Lasso to reduce the dimension of the extracted feature information and then the Synthetic Minority Oversampling Technique (SMOTE) method was used to deal with unbalanced data. Finally, the processed feature vectors were input into a random forest (RF) classifier to predict drug-target interactions. Through 10 trials of 5-fold cross-validation, the overall prediction accuracies on the enzyme, ion channel (IC), G-proteincoupled receptor (GPCR) and nuclear receptor (NR) datasets reached 98.09%, 97.32%, 95.69%, and 94.88%, respectively, and compared with other prediction methods. In addition, we have tested and verified that our method not only could be applied to predict the new interactions but also could obtain a satisfactory result on the new dataset. All the experimental results indicate that our method can significantly improve the prediction accuracy of drug-target interactions and play a vital role in the new drug research and target protein development. The source code and all datasets are available at htttps://github.com/QUST-AIBBDRC/LRF-DTIs/ for academic use.
引用
收藏
页码:1839 / 1852
页数:14
相关论文
共 50 条
  • [1] RFDT: A Rotation Forest-based Predictor for Predicting Drug-Target Interactions Using Drug Structure and Protein Sequence Information
    Wang, Lei
    You, Zhu-Hong
    Chen, Xing
    Yan, Xin
    Liu, Gang
    Zhang, Wei
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2018, 19 (05) : 445 - 454
  • [2] Predicting drug-target interaction based on sequence and structure information
    Lan, Wei
    Wang, Jianxin
    Li, Min
    Zheng, Ruiqing
    Wu, Fang-Xiang
    Pan, Yi
    IFAC PAPERSONLINE, 2015, 48 (28): : 12 - 16
  • [3] Predicting Drug-Target Interactions Using Drug-Drug Interactions
    Kim, Shinhyuk
    Jin, Daeyong
    Lee, Hyunju
    PLOS ONE, 2013, 8 (11):
  • [4] SDTRLS: Predicting Drug-Target Interactions for Complex Diseases Based on Chemical Substructures
    Yan, Cheng
    Wang, Jianxin
    Lan, Wei
    Wu, Fang-Xiang
    Pan, Yi
    COMPLEXITY, 2017,
  • [5] Predicting Drug-Target Interactions With Multi-Information Fusion
    Peng, Lihong
    Liao, Bo
    Zhu, Wen
    Li, Zejun
    Li, Keqin
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2017, 21 (02) : 561 - 572
  • [6] Predicting Drug-Target Interactions of Nuclear Receptors Based on Molecular Descriptors Information
    Wang, Junxiang
    Yu, Weiming
    Chen, Zhibin
    Li, Hengda
    Jiang, Zhenran
    LETTERS IN DRUG DESIGN & DISCOVERY, 2013, 10 (10) : 989 - 994
  • [7] MSPEDTI: Prediction of Drug-Target Interactions via Molecular Structure with Protein Evolutionary Information
    Wang, Lei
    Wong, Leon
    Chen, Zhan-Heng
    Hu, Jing
    Sun, Xiao-Fei
    Li, Yang
    You, Zhu-Hong
    BIOLOGY-BASEL, 2022, 11 (05):
  • [8] Predicting Drug-Target Interactions Over Heterogeneous Information Network
    Su, Xiaorui
    Hu, Pengwei
    Yi, Haicheng
    You, Zhuhong
    Hu, Lun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (01) : 562 - 572
  • [9] Incorporating chemical sub-structures and protein evolutionary information for inferring drug-target interactions
    Lei Wang
    Zhu-Hong You
    Li-Ping Li
    Xin Yan
    Wei Zhang
    Scientific Reports, 10
  • [10] Incorporating chemical sub-structures and protein evolutionary information for inferring drug-target interactions
    Wang, Lei
    You, Zhu-Hong
    Li, Li-Ping
    Yan, Xin
    Zhang, Wei
    SCIENTIFIC REPORTS, 2020, 10 (01)