A similarity-based method for prediction of drug side effects with heterogeneous information

被引:134
|
作者
Xian, Zhao [1 ]
Lei, Chen [1 ,2 ]
Jing, Lu [3 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
[2] East China Normal Univ, Shanghai Key Lab PMMP, Shanghai 200241, Peoples R China
[3] Yantai Univ, Collaborat Innovat Ctr Adv Drug Delivery Syst & B, Minist Educ, Sch Pharm,Key Lab Mol Pharmacol & Drug Evaluat, Yantai 264005, Peoples R China
基金
上海市自然科学基金;
关键词
Drug side effect; Drug similarity; ATC code; Target protein; Minimum redundancy maximum relevance; RANDOM FOREST; INTERACTION NETWORKS; CLASSIFICATION; INTEGRATION; RELEVANCE; STITCH; KEGG;
D O I
10.1016/j.mbs.2018.09.010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Drugs can produce intended therapeutic effects to treat different diseases. However, they may also cause side effects at the same time. For an approved drug, it is best to detect all side effects it can produce. Otherwise, it may bring great risks for pharmaceuticals companies as well as be harmful to human body. It is urgent to design quick and reliable identification methods to detect the side effects for a given drug. In this study, a binary classification model was proposed to predict drug side effects. Different from most previous methods, our model termed the pair of drug and side effect as a sample and convert the original problem to a binary classification problem. Based on the similarity idea, each pair was represented by five features, each of which was derived from a type of drug property. The strong machine learning algorithm, random forest, was adopted as the prediction engine. The ten-fold cross-validation on five datasets with different negative samples indicated that the proposed model yielded a good performance of Matthews correlation coefficient around 0.550 and AUC around 0.8492. In addition, we also analyzed the contribution of each drug property for construction of the model. The results indicated that drug similarity in fingerprint was most related to the prediction of drug side effects and all drug properties gave less or more contributions.
引用
收藏
页码:136 / 144
页数:9
相关论文
共 50 条
  • [41] A Semantic Similarity-Based Identification Method for Implicit Citation Functions and Sentiments Information
    Malkawi, Rami
    Daradkeh, Mohammad
    El-Hassan, Ammar
    Petrov, Pavel
    [J]. INFORMATION, 2022, 13 (11)
  • [42] Predicting Drug-Drug Interactions Through Similarity-Based Link Prediction Over Web Data
    Fokoue, Achille
    Hassanzadeh, Oktie
    Sadoghi, Mohammad
    Zhang, Ping
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 175 - 178
  • [43] Predicting Drug-Drug Interactions Through Large-Scale Similarity-Based Link Prediction
    Fokoue, Achille
    Sadoghi, Mohammad
    Hassanzadeh, Oktie
    Zhang, Ping
    [J]. SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, 2016, 9678 : 774 - 789
  • [44] A Similarity-based Fuzzy Soft Reasoning Method
    Wang, Lu
    Xue, Binbin
    Qin, Keyun
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [45] The directional similarity-based clustering method DSCM
    School of Information Engineering, Southern Yangtze University, Wuxi 214036, China
    不详
    不详
    不详
    [J]. Jisuanji Yanjiu yu Fazhan, 2006, 8 (1425-1431):
  • [46] Attacking Similarity-Based Link Prediction in Social Networks
    Zhou, Kai
    Michalak, Tomasz P.
    Waniek, Marcin
    Rahwan, Talal
    Vorobeychik, Yevgeniy
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 305 - 313
  • [47] Similarity-based Method for Reduction of Fuzzy Rules
    Garcia-Garcia, Arturo
    Reformat, Marek Z.
    Mendez-Vazquez, Andres
    [J]. 2016 ANNUAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY (NAFIPS), 2016,
  • [48] Software framework for similarity-based prediction of protein interfaces
    Jelinek, Jan
    Skoda, Petr
    Hoksza, David
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2759 - 2761
  • [49] Localizing Heterogeneous Access Points using Similarity-based Sequence
    Liu, Ran
    Padmal, Madhushanka
    Marakkalage, Sumudu Hasala
    Shaganan, Thiruketheeswaran
    Yuen, Chau
    Tan, U-Xuan
    [J]. 2018 3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (IEEE ICARM), 2018, : 306 - 311
  • [50] A method for similarity-based grouping of biological data
    Jakoniene, Vaida
    Rundqvist, David
    Lambrix, Patrick
    [J]. DATA INTEGRATION IN THE LIFE SCIENCES, PROCEEDINGS, 2006, 4075 : 136 - 151