DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions

被引:31
|
作者
Zheng, Yi [1 ]
Peng, Hui [1 ]
Zhang, Xiaocai [1 ]
Zhao, Zhixun [1 ]
Gao, Xiaoying [2 ]
Li, Jinyan [1 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat Technol, Adv Analyt Inst, 15 Broadway, Ultimo, NSW 2007, Australia
[2] Victoria Univ Wellington, Sch Engn & Comp Sci, Cotton Bldg,Kelburn Campus, Wellington 6140, New Zealand
关键词
Drug interaction prediction; Positive-unlabeled learning; Reliable negative samples; ALGORITHM; DATABASE;
D O I
10.1186/s12859-019-3214-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Drug-drug interactions (DDIs) are a major concern in patients' medication. It's unfeasible to identify all potential DDIs using experimental methods which are time-consuming and expensive. Computational methods provide an effective strategy, however, facing challenges due to the lack of experimentally verified negative samples. Results: To address this problem, we propose a novel positive-unlabeled learning method named DDI-PULearn for large-scale drug-drug-interaction predictions. DDI-PULearn first generates seeds of reliable negatives via OCSVM (one-class support vector machine) under a high-recall constraint and via the cosine-similarity based KNN (k-nearest neighbors) as well. Then trained with all the labeled positives (i.e., the validated DDIs) and the generated seed negatives, DDI-PULearn employs an iterative SVM to identify a set of entire reliable negatives from the unlabeled samples (i.e., the unobserved DDIs). Following that, DDI-PULearn represents all the labeled positives and the identified negatives as vectors of abundant drug properties by a similarity-based method. Finally, DDI-PULearn transforms these vectors into a lower-dimensional space via PCA (principal component analysis) and utilizes the compressed vectors as input for binary classifications. The performance of DDI-PULearn is evaluated on simulative prediction for 149,878 possible interactions between 548 drugs, comparing with two baseline methods and five state-of-the-art methods. Related experiment results show that the proposed method for the representation of DDIs characterizes them accurately. DDI-PULearn achieves superior performance owing to the identified reliable negatives, outperforming all other methods significantly. In addition, the predicted novel DDIs suggest that DDI-PULearn is capable to identify novel DDIs. Conclusions: The results demonstrate that positive-unlabeled learning paves a new way to tackle the problem caused by the lack of experimentally verified negatives in the computational prediction of DDIs.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
    Yi Zheng
    Hui Peng
    Xiaocai Zhang
    Zhixun Zhao
    Xiaoying Gao
    Jinyan Li
    [J]. BMC Bioinformatics, 20
  • [2] Screening drug-target interactions with positive-unlabeled learning
    Peng, Lihong
    Zhu, Wen
    Liao, Bo
    Duan, Yu
    Chen, Min
    Chen, Yi
    Yang, Jialiang
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [3] Screening drug-target interactions with positive-unlabeled learning
    Lihong Peng
    Wen Zhu
    Bo Liao
    Yu Duan
    Min Chen
    Yi Chen
    Jialiang Yang
    [J]. Scientific Reports, 7
  • [4] Positive-Unlabeled Learning for inferring drug interactions based on heterogeneous attributes
    Hameed, Pathima Nusrath
    Verspoor, Karin
    Kusljic, Snezana
    Halgamuge, Saman
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [5] Positive-Unlabeled Learning for inferring drug interactions based on heterogeneous attributes
    Pathima Nusrath Hameed
    Karin Verspoor
    Snezana Kusljic
    Saman Halgamuge
    [J]. BMC Bioinformatics, 18
  • [6] Similarity-based modeling in large-scale prediction of drug-drug interactions
    Vilar, Santiago
    Uriarte, Eugenio
    Santana, Lourdes
    Lorberbaum, Tal
    Hripcsak, George
    Friedman, Carol
    Tatonetti, Nicholas P.
    [J]. NATURE PROTOCOLS, 2014, 9 (09) : 2147 - 2163
  • [7] Similarity-based modeling in large-scale prediction of drug-drug interactions
    Santiago Vilar
    Eugenio Uriarte
    Lourdes Santana
    Tal Lorberbaum
    George Hripcsak
    Carol Friedman
    Nicholas P Tatonetti
    [J]. Nature Protocols, 2014, 9 : 2147 - 2163
  • [8] A positive and unlabeled learning framework based on extreme learning machine for drug-drug interactions discovery
    Bi X.
    Ma H.
    Li J.
    Ma Y.
    Chen D.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (11) : 1 - 12
  • [9] CNN-DDI: A novel deep learning method for predicting drug-drug interactions
    Zhang, Chengcheng
    Zang, Tianyi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1708 - 1713
  • [10] An optimized positive-unlabeled learning method for detecting a large scale of malware variants
    Zhang, Jixin
    Khan, Mohammad Faham
    Lin, Xiaodong
    Qin, Zheng
    [J]. 2019 IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (DSC), 2019, : 182 - 189