Feature Selection in Single-Cell RNA-seq Data via a Genetic Algorithm

被引:3
|
作者
Chatzilygeroudis, Konstantinos I. [1 ,2 ]
Vrahatis, Aristidis G. [3 ]
Tasoulis, Sotiris K. [3 ]
Vrahatis, Michael N. [2 ]
机构
[1] Univ Patras, CEID, Patras, Greece
[2] Univ Patras, Dept Math, Computat Intelligence Lab, Patras, Greece
[3] Univ Thessaly, Dept Comp Sci & Biomed Informat, Volos, Greece
关键词
Feature selection; Optimization; Single-cell RNA-seq; High-dimensional data; EXPRESSION DATA; CLASSIFICATION; KERNEL;
D O I
10.1007/978-3-030-92121-7_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data methods prevail in the biomedical domain leading to effective and scalable data-driven approaches. Biomedical data are known for their ultra-high dimensionality, especially the ones coming from molecular biology experiments. This property is also included in the emerging technique of single-cell RNA-sequencing (scRNA-seq), where we obtain sequence information from individual cells. A reliable way to uncover their complexity is by using Machine Learning approaches, including dimensional reduction and feature selection methods. Although the first choice has had remarkable progress in scRNA-seq data, only the latter can offer deeper interpretability at the gene level since it highlights the dominant gene features in the given data. Towards tackling this challenge, we propose a feature selection framework that utilizes genetic optimization principles and identifies low-dimensional combinations of gene lists in order to enhance classification performance of any off-the-shelf classifier (e.g., LDA or SVM). Our intuition is that by identifying an optimal genes subset, we can enhance the prediction power of scRNA-seq data even if these genes are unrelated to each other. We showcase our proposed framework's effectiveness in two real scRNA-seq experiments with gene dimensions up to 36708. Our framework can identify very low-dimensional subsets of genes (less than 200) while boosting the classifiers' performance. Finally, we provide a biological interpretation of the selected genes, thus providing evidence of our method's utility towards explainable artificial intelligence.
引用
收藏
页码:66 / 79
页数:14
相关论文
共 50 条
  • [31] SCnorm: robust normalization of single-cell RNA-seq data
    Bacher, Rhonda
    Chu, Li-Fang
    Leng, Ning
    Gasch, Audrey P.
    Thomson, James A.
    Stewart, Ron M.
    Newton, Michael
    Kendziorski, Christina
    NATURE METHODS, 2017, 14 (06) : 584 - +
  • [32] Quantifying the clusterness and trajectoriness of single-cell RNA-seq data
    Lim, Hong Seo
    Qiu, Peng
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (02)
  • [33] Evaluating imputation methods for single-cell RNA-seq data
    Cheng, Yi
    Ma, Xiuli
    Yuan, Lang
    Sun, Zhaoguo
    Wang, Pingzhang
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [34] Challenges in unsupervised clustering of single-cell RNA-seq data
    Kiselev, Vladimir Yu
    Andrews, Tallulah S.
    Hemberg, Martin
    NATURE REVIEWS GENETICS, 2019, 20 (05) : 273 - 282
  • [35] GSE: Graph similarity enhancement algorithm for single-cell RNA-seq data clustering
    Bu, Shugui
    Guo, Lilu
    Li, Rongyuan
    Lu, Jianbo
    Zhu, Xiaoshu
    2019 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP 2019), 2019, : 406 - 410
  • [36] Testing for Phylogenetic Signal in Single-Cell RNA-Seq Data
    Moravec, Jiri C.
    Lanfear, Robert
    Spector, David L.
    Diermeier, Sarah D.
    Gavryushkin, Alex
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2023, 30 (04) : 518 - 537
  • [37] Locality Sensitive Imputation for Single-Cell RNA-Seq Data
    Moussa, Marmar
    Mandoiu, Ion I.
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2018, 2018, 10847 : 347 - 360
  • [38] Supervised Adversarial Alignment of Single-Cell RNA-seq Data
    Ge, Songwei
    Wang, Haohan
    Alavi, Amir
    Xing, Eric
    Bar-Joseph, Ziv
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (05) : 501 - 513
  • [39] Phylogenetic inference from single-cell RNA-seq data
    Xuan Liu
    Jason I. Griffiths
    Isaac Bishara
    Jiayi Liu
    Andrea H. Bild
    Jeffrey T. Chang
    Scientific Reports, 13
  • [40] Phylogenetic inference from single-cell RNA-seq data
    Liu, Xuan
    Griffiths, Jason I.
    Bishara, Isaac
    Liu, Jiayi
    Bild, Andrea H.
    Chang, Jeffrey T.
    SCIENTIFIC REPORTS, 2023, 13 (01)