Screening of serum exosome markers for colorectal cancer based on Boruta and multi-cluster feature selection algorithms

被引:3
|
作者
Zhu, Jian [1 ]
Luo, Junjie [1 ]
Ma, Yao [1 ]
机构
[1] First Peoples Hosp Linping Dist, Gen Surg Dept, 369 Yingbin Rd, Hangzhou 311100, Peoples R China
关键词
GEO; Colorectal cancer; Exosomes; miRNAs; Boruta; MCFS; CIRCULATING MICRORNAS; EARLY-DIAGNOSIS; VALIDATION; PROGNOSIS; PREDICTION;
D O I
10.1007/s13273-023-00348-z
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundEarly and timely diagnosis benefits the prognosis of patients with colorectal cancer (CRC). The purpose of this study was to explore biomarkers with diagnostic ability in colorectal cancer.ObjectiveBoruta and multi-cluster feature selection (MCFS) algorithms were employed to analyze the expression data of serum exosome-sourced microRNAs (miRNAs) in CRC patients in the Gene Expression Omnibus (GEO) database to find candidate feature miRNAs that could distinguish between CRC and normal samples. To identify the feature miRNAs with the highest diagnostic ability, different support vector machine (SVM) classifiers were constructed, and the SVM classifier with the highest F1 score was selected based on IFS curve. To validate the clinical application value of the classifier, serum samples from 32 CRC patients and 19 healthy individuals were collected as external validation sets, and serum exosomes were extracted for quantitative real-time polymerase chain reaction (qRT-PCR) analysis. The data were imported into the model to verify the performance of the model.ResultAfter feature selection by Boruta and MCFS algorithms, the first five candidate miRNAs (miR-21, miR-193b, miR-23a, miR-575, and miR-610) with sufficient ability to distinguish sample types were identified. In a series of classifiers constructed, the SVM classifier composed of the first four feature miRNAs (miR-21, miR-193b, miR-23a, and miR-575) was determined to have the best classification effect. qRT-PCR results of serum exosome miRNAs in clinical samples demonstrated that the expression of miR-21, miR-193b, and miR-23a in serum exosomes from CRC patients was significantly higher than that in normal samples, while that of miR-575 was significantly lower than that in normal samples. Subsequently, the receiver operating characteristics (ROC) curve of the diagnostic model based on four feature miRNAs was plotted. According to the results, the area under concentrations curves (AUC) value of the diagnostic model was 0.854, which suggested that the predictive performance of the model built on miR-21, miR-193b, miR-23a, and miR-575 was effective enough to distinguish healthy subjects from CRC patients. In addition, the expression of miR-21, miR-193b, miR-23a, and miR-575 was closely related to the tumor size, stage, and the presence of distant metastasis in CRC patients.ConclusionMiR-21, miR-193b, miR-23a, and miR-575 may be a new potential biomarker combination for the diagnosis of CRC. Clinical biopsy combined with these miRNA biomarkers is expected to promote the early diagnosis of CRC, thus optimizing the prognosis of CRC patients.
引用
收藏
页码:343 / 351
页数:9
相关论文
共 50 条
  • [1] Screening of serum exosome markers for colorectal cancer based on Boruta and multi-cluster feature selection algorithms
    Jian Zhu
    Junjie Luo
    Yao Ma
    Molecular & Cellular Toxicology, 2024, 20 : 343 - 351
  • [2] Multi-Cluster Feature Selection Based on Isometric Mapping
    Yadi Wang
    Zefeng Zhang
    Yinghao Lin
    IEEE/CAAJournalofAutomaticaSinica, 2022, 9 (03) : 570 - 572
  • [3] Multi-Cluster Feature Selection Based on Isometric Mapping
    Wang, Yadi
    Zhang, Zefeng
    Lin, Yinghao
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (03) : 570 - 572
  • [4] Efficient multi-cluster feature selection on text data
    Gupta, Ananya
    Begum, Shahin Ara
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (08): : 1583 - 1598
  • [5] A Comparative Study on Feature Selection Techniques for Multi-cluster Text Data
    Gupta, Ananya
    Begum, Shahin Ara
    HARMONY SEARCH AND NATURE INSPIRED OPTIMIZATION ALGORITHMS, 2019, 741 : 203 - 215
  • [6] Feature Selection on Data Stream via Multi-Cluster structure Preservation
    Ma, Rui
    Wang, Yijie
    Cheng, Li
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1065 - 1074
  • [7] Unsupervised Feature Selection for Multi-cluster Data via Smooth Distributed Score
    Liu, Furui
    Liu, Xiyan
    EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, 2012, 304 : 74 - +
  • [8] Screening of lung cancer serum biomarkers based on Boruta-shap and RFC-RFECV algorithms
    Yue, Guangcheng
    JOURNAL OF PROTEOMICS, 2024, 301
  • [9] Classification of Brain MRI using Multi-Cluster Feature Selection and KNN Classifier
    Kalbkhani, Hashem
    Salimi, Arghavan
    Shayesteh, Mahrokh G.
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 93 - 98
  • [10] Gearbox fault diagnosis based on improved multi-scale fluctuation dispersion entropy and multi-cluster feature selection
    Li, Baoyue
    Yu, Yonghua
    Wang, Weicheng
    Zhang, Ning
    Xie, Meiqiang
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024,