Integrated multiple microarray studies by robust rank aggregation to identify immune-associated biomarkers in Crohn's disease based on three machine learning methods

被引:8
|
作者
Chen, Zi-An [1 ,2 ]
Ma, Hui-hui [1 ,2 ]
Wang, Yan [1 ,2 ]
Tian, Hui [1 ,2 ]
Mi, Jian-wei [1 ,2 ]
Yao, Dong-Mei [1 ,2 ]
Yang, Chuan-Jie [1 ,2 ]
机构
[1] Hebei Med Univ, Dept Gastroenterol, Hosp 2, Shijiazhuang 050000, Hebei, Peoples R China
[2] Hebei Clin Res Ctr Digest Dis, Hebei Inst Gastroenterol, Hebei Key Lab Gastroenterol, Shijiazhuang 050000, Hebei, Peoples R China
关键词
COLONY-ENHANCING FACTOR; INFLAMMATORY-BOWEL-DISEASE; ULCERATIVE-COLITIS; CELLS; EXPRESSION; ARTHRITIS; INTERLEUKIN-17; RECEPTOR; PATHOGENESIS; SUPPRESSION;
D O I
10.1038/s41598-022-26345-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Crohn's disease (CD) is a complex autoimmune disorder presumed to be driven by complex interactions of genetic, immune, microbial and even environmental factors. Intrinsic molecular mechanisms in CD, however, remain poorly understood. The identification of novel biomarkers in CD cases based on larger samples through machine learning approaches may inform the diagnosis and treatment of diseases. A comprehensive analysis was conducted on all CD datasets of Gene Expression Omnibus (GEO); our team then used the robust rank aggregation (RRA) method to identify differentially expressed genes (DEGs) between controls and CD patients. PPI (protein-protein interaction) network and functional enrichment analyses were performed to investigate the potential functions of the DEGs, with molecular complex detection (MCODE) identifying some important functional modules from the PPI network. Three machine learning algorithms, support vector machine-recursive feature elimination (SVM-RFE), random forest (RF), and least absolute shrinkage and selection operator (LASSO), were applied to determine characteristic genes, which were verified by ROC curve analysis and immunohistochemistry (IHC) using clinical samples. Univariable and multivariable logistic regression were used to establish a machine learning score for diagnosis. Single-sample GSEA (ssGSEA) was performed to examine the correlation between immune infiltration and biomarkers. In total, 5 datasets met the inclusion criteria: GSE75214, GSE95095, GSE126124, GSE179285, and GSE186582. Based on RRA integrated analysis, 203 significant DEGs were identified (120 upregulated genes and 83 downregulated genes), and MCODE revealed some important functional modules in the PPI network. Machine learning identified LCN2, REG1A, AQP9, CCL2, GIP, PROK2, DEFA5, CXCL9, and NAMPT; AQP9, PROK2, LCN2, and NAMPT were further verified by ROC curves and IHC in the external cohort. The final machine learning score was defined as [Expression level of AQP9 x (2.644)] + [Expression level of LCN2 x (0.958)] + [Expression level of NAMPT x (1.115)]. ssGSEA showed markedly elevated levels of dendritic cells and innate immune cells, such as macrophages and NK cells, in CD, consistent with the gene enrichment results that the DEGs are mainly involved in the IL-17 signaling pathway and humoral immune response. The selected biomarkers analyzed by the RRA method and machine learning are highly reliable. These findings improve our understanding of the molecular mechanisms of CD pathogenesis.
引用
收藏
页数:15
相关论文
共 5 条
  • [1] Integrated multiple microarray studies by robust rank aggregation to identify immune-associated biomarkers in Crohn's disease based on three machine learning methods
    Zi-An Chen
    Hui-hui Ma
    Yan Wang
    Hui Tian
    Jian-wei Mi
    Dong-Mei Yao
    Chuan-Jie Yang
    Scientific Reports, 13
  • [2] Predicting diagnostic biomarkers associated with immune infiltration in Crohn's disease based on machine learning and bioinformatics
    Wenhui Bao
    Lin Wang
    Xiaoxiao Liu
    Ming Li
    European Journal of Medical Research, 28
  • [3] Predicting diagnostic biomarkers associated with immune infiltration in Crohn's disease based on machine learning and bioinformatics
    Bao, Wenhui
    Wang, Lin
    Liu, Xiaoxiao
    Li, Ming
    EUROPEAN JOURNAL OF MEDICAL RESEARCH, 2023, 28 (01)
  • [4] Identification of immune-associated biomarkers of diabetes nephropathy tubulointerstitial injury based on machine learning: a bioinformatics multi-chip integrated analysis
    Wang, Lin
    Su, Jiaming
    Liu, Zhongjie
    Ding, Shaowei
    Li, Yaotan
    Hou, Baoluo
    Hu, Yuxin
    Dong, Zhaoxi
    Tang, Jingyi
    Liu, Hongfang
    Liu, Weijing
    BIODATA MINING, 2024, 17 (01):
  • [5] Integrated analysis of multiple microarray studies to establish differential diagnostic models of Crohn's disease and ulcerative colitis based on a metalloproteinase-associated module
    Deng, Jiang
    Zhao, Ning
    Lv, Li-ping
    Ma, Ping
    Zhang, Yang-yang
    Xu, Jin-bo
    Zhou, Xi-peng
    Chen, Zi-an
    Zhang, Yan-yu
    FRONTIERS IN IMMUNOLOGY, 2022, 13