Similarity-Based Machine Learning Model for Predicting the Metabolic Pathways of Compounds

被引:51
|
作者
Jia, Yanjuan [1 ]
Zhao, Ran [1 ]
Chen, Lei [1 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
基金
上海市自然科学基金;
关键词
Compounds; Feature extraction; Biochemistry; Machine learning; Radio frequency; Classification algorithms; Predictive models; Metabolic pathway; chemical-chemical association; random forest; NETWORKS; STITCH;
D O I
10.1109/ACCESS.2020.3009439
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Metabolic pathways refer to the continuous chemical reactions in the metabolic process in vivo. Compounds are the major participant for most metabolic pathways. It is essential to determine which compounds can constitute a metabolic pathway. This problem can be converted to the identification of the metabolic pathways of compounds. Although traditional experiments can provide solid results, they are always of low efficiency and high cost. To date, several machine leaning models have been proposed to address this problem. However, almost all models only identified metabolic pathway types of compounds rather than actual metabolic pathways. This study proposed a novel model for predicting actual metabolic pathways for given compounds. The pairs of compounds and metabolic pathways were termed as samples, thereby modeling a binary classification problem. With the concept of "similarity", each sample was represented by seven features, extracted from seven associations of compounds, which measure compound linkages from different aspects. The model adopted random forest as the classification algorithm. Two types of ten-fold cross-validation were adopted to evaluate the performance of the model, indicating its utility. A feature analysis was also performed to determine which compound association was highly related to the identification of metabolic pathways of compounds.
引用
收藏
页码:130687 / 130696
页数:10
相关论文
共 50 条
  • [31] Similarity-based and knowledge-based processes in category learning
    Hayes, BK
    Taplin, JE
    EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 1995, 7 (04): : 383 - 410
  • [32] Similarity-based integrity protection for deep learning systems
    Hou, Ruitao
    Ai, Shan
    Chen, Qi
    Yan, Hongyang
    Huang, Teng
    Chen, Kongyang
    INFORMATION SCIENCES, 2022, 601 : 255 - 267
  • [33] Fast Abductive Learning by Similarity-based Consistency Optimization
    Huang, Yu-Xuan
    Dai, Wang-Zhou
    Cai, Le-Wen
    Muggleton, Stephen
    Jiang, Yuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [34] Maximum entropy generative models for similarity-based learning
    Gupta, Maya R.
    Cazzanti, Luca
    Koppal, Anjah J.
    2007 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS, VOLS 1-7, 2007, : 2221 - +
  • [35] A similarity-based QSAR model for predicting acute toxicity towards the fathead minnow (Pimephales promelas)
    Cassotti, M.
    Ballabio, D.
    Todeschini, R.
    Consonni, V.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2015, 26 (03) : 217 - 243
  • [36] A Similarity-Based Learning Algorithm Using Distance Transformation
    Hu, Yuh-Jyh
    Yu, Min-Che
    Wang, Hsiang-An
    Ting, Zih-Yun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (06) : 1452 - 1464
  • [37] AN EXPERIMENT IN THE APPLICATION OF SIMILARITY-BASED LEARNING TO PROGRAMMING BY EXAMPLE
    MITROVIC, A
    WITTEN, IH
    MAULSBY, DL
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1994, 9 (04) : 341 - 364
  • [38] Interpoint Similarity-Based Uncertainty Measure for Robust Learning
    Wang, Yan
    Li, Han-Xiong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (12): : 5386 - 5394
  • [39] A Combined Bayesian and Similarity-Based Approach for Predicting E. coli Biofilm Inhibition by Phenolic Natural Compounds
    Stepanov, Dmitri
    Buchmann, David
    Schultze, Nadin
    Wolber, Gerhard
    Schaufler, Katharina
    Guenther, Sebastian
    Belik, Vitaly
    JOURNAL OF NATURAL PRODUCTS, 2022, 85 (10): : 2255 - 2265
  • [40] Similarity-Based Methods and Machine Learning Approaches for Target Prediction in Early Drug Discovery: Performance and Scope
    Mathai, Neann
    Kirchmair, Johannes
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (10)