CogNet: classification of gene expression data based on ranked active-subnetwork-oriented KEGG pathway enrichment analysis

被引:0
|
作者
Yousef M. [1 ,2 ]
Ülgen E. [3 ]
Sezerman O.U. [3 ]
机构
[1] Galilee Digital Health Research Center (GDH), Zefat Academic College, Zefat
[2] Department of Information Systems, Zefat Academic College, Zefat
[3] Department of Biostatistics and Medical Informatics, School of Medicine, Acibadem Mehmet Ali Aydinlar University, Istanbul
来源
Yousef, Malik (malik.yousef@zefat.ac.il) | 1600年 / PeerJ Inc.卷 / 07期
关键词
Bioinformatics; Classification; Data mining; Data science; Enrichment analysis; Gene expression; Genomics; KEGG pathway; Machine learning; Rank;
D O I
10.7717/PEERJ-CS.336
中图分类号
学科分类号
摘要
Most of the traditional gene selection approaches are borrowed from other fields such as statistics and computer science, However, they do not prioritize biologically relevant genes since the ultimate goal is to determine features that optimize model performance metrics not to build a biologically meaningful model. Therefore, there is an imminent need for new computational tools that integrate the biological knowledge about the data in the process of gene selection and machine learning. Integrative gene selection enables incorporation of biological domain knowledge from external biological resources. In this study, we propose a new computational approach named CogNet that is an integrative gene selection tool that exploits biological knowledge for grouping the genes for the computational modeling tasks of ranking and classification. In CogNet, the pathfindR serves as the biological grouping tool to allow the main algorithm to rank active-subnetwork-oriented KEGG pathway enrichment analysis results to build a biologically relevant model. CogNet provides a list of significant KEGG pathways that can classify the data with a very high accuracy. The list also provides the genes belonging to these pathways that are differentially expressed that are used as features in the classification problem. The list facilitates deep analysis and better interpretability of the role of KEGG pathways in classification of the data thus better establishing the biological relevance of these differentially expressed genes. Even though the main aim of our study is not to improve the accuracy of any existing tool, the performance of the CogNet outperforms a similar approach called maTE while obtaining similar performance compared to other similar tools including SVM-RCE. CogNet was tested on 13 gene expression datasets concerning a variety of diseases. © 2021. Yousef et al.
引用
收藏
页码:1 / 20
页数:19
相关论文
共 37 条
  • [1] CogNet: classification of gene expression data based on ranked active-subnetwork- oriented KEGG pathway enrichment analysis
    Yousef, Malik
    Ulgen, Ege
    Sezerman, Osman Ugur
    PEERJ COMPUTER SCIENCE, 2021,
  • [2] Gene Ontology and KEGG Pathway Enrichment Analysis of a Drug Target-Based Classification System
    Chen, Lei
    Chu, Chen
    Lu, Jing
    Kong, Xiangyin
    Huang, Tao
    Cai, Yu-Dong
    PLOS ONE, 2015, 10 (05):
  • [3] Array2KEGG: Web-based tool of KEGG pathway analysis for gene expression profile
    Kim, Jun-Sub
    Kim, Seung-Jun
    Park, Hye-Won
    Youn, Jong-Pil
    An, Yu Ri
    Cho, Hyunseok
    Hwang, Seung Yong
    BIOCHIP JOURNAL, 2010, 4 (02) : 134 - 140
  • [4] Array2KEGG: Web-based tool of KEGG Pathway analysis for gene expression profile
    Kim, Jun-Sub
    Kim, Seung Jun
    Park, Hye-Won
    Yu, So Yeon
    Youn, Jong-Pil
    An, Yu Ri
    Cho, Hyunseok
    Hwang, Seung Yong
    MOLECULAR & CELLULAR TOXICOLOGY, 2010, 6 (03) : 43 - 43
  • [5] Array2KEGG: Web-based tool of KEGG pathway analysis for gene expression profile
    Jun-Sub Kim
    Seung-Jun Kim
    Hye-Won Park
    Jong-Pil Youn
    Yu Ri An
    Hyunseok Cho
    Seung Yong Hwang
    BioChip Journal, 2010, 4 : 134 - 140
  • [6] Analysis and prediction of protein stability based on interaction network, gene ontology, and KEGG pathway enrichment scores
    Huang, Feiming
    Fu, Minfei
    Li, JiaRui
    Chen, Lei
    Feng, KaiYan
    Huang, Tao
    Cai, Yu-Dong
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2023, 1871 (03):
  • [7] A Method of Pathway Enrichment Analysis Based Gene Expression Variability
    Jia Xiao-Dong
    Chen Xiu-Jie
    Wu Xin
    Xu Jian-Kai
    Tan Fu-Jian
    Liu Xiang-Qiong
    Liu Lei
    Yang Rui-Zhi
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2013, 40 (12) : 1256 - 1264
  • [8] STAGEs: A web-based tool that integrates data visualization and pathway enrichment analysis for gene expression studies
    Clara W. T. Koh
    Justin S. G. Ooi
    Eugenia Ziying Ong
    Kuan Rong Chan
    Scientific Reports, 13
  • [9] STAGEs: A web-based tool that integrates data visualization and pathway enrichment analysis for gene expression studies
    Koh, Clara W. T.
    Ooi, Justin S. G.
    Ong, Eugenia Ziying
    Chan, Kuan Rong
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [10] Integrative Analysis of Gene Expression Data Including an Assessment of Pathway Enrichment for Predicting Prostate Cancer
    Hu, Pingzhao
    Celia, M.
    Greenwood
    Beyene, Joseph
    CANCER INFORMATICS, 2006, 2 : 289 - 300