Comparative Analysis of Supervised Cell Type Detection in Single-Cell RNA-seq Data

被引:1
|
作者
Vasighizaker, Akram [1 ]
Hora, Sheena [1 ]
Trivedi, Yash [1 ]
Rueda, Luis [1 ]
机构
[1] Univ Windsor, Sch Comp Sci, Windsor, ON, Canada
关键词
Cell type identification; scRNA-seq data analysis; Marker gene identification; Feature selection; Classification;
D O I
10.1007/978-3-031-07802-6_28
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recent studies on Single-cell RNA sequencing (scRNA-seq) technology have been widely applied in biological research and drug discovery. Before in-depth investigations of the functionality of single cells for pathological goals, identification of cell types is an essential step. Recently, several unsupervised learning methods have been developed to identify cell types. However, annotating clusters with the correct cell types require considerable efforts using marker genes. Due to the lack of enough annotated datasets, supervised techniques have not been commonly used in scRNA-seq studies. On the other hand, classification methods use feature selection algorithms to improve the prediction accuracy by finding the most informative features among many in high-dimensional datasets. Hence, to automating the process of annotation of clusters of cell types, we can take advantage of classification models. This article evaluated the performance of three state-of-the-art supervised classification methods, namely support vector machine, k-nearest neighbor, and random forest combined with three feature selection methods, namely Chi-squared, information gain, and ANOVA F-value. The results of applying nine combinations of these methods on three standard scRNA-seq datasets show that support vector machine combined with information gain outperforms other combinations of techniques. Moreover, we investigated reference gene sets and found 11 out of 20 highly variable genes in two different Pancreas gene sets to validate our findings. This article sheds some light on the potential use of identifying marker genes to improve the automatic identification of cell types.
引用
收藏
页码:333 / 345
页数:13
相关论文
共 50 条
  • [1] Supervised Adversarial Alignment of Single-Cell RNA-seq Data
    Ge, Songwei
    Wang, Haohan
    Alavi, Amir
    Xing, Eric
    Bar-Joseph, Ziv
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (05) : 501 - 513
  • [2] A web server for comparative analysis of single-cell RNA-seq data
    Amir Alavi
    Matthew Ruffalo
    Aiyappa Parvangada
    Zhilin Huang
    Ziv Bar-Joseph
    [J]. Nature Communications, 9
  • [3] A web server for comparative analysis of single-cell RNA-seq data
    Alavi, Amir
    Ruffalo, Matthew
    Parvangada, Aiyappa
    Huang, Zhilin
    Bar-Joseph, Ziv
    [J]. NATURE COMMUNICATIONS, 2018, 9
  • [4] Comparative Analysis of Single-Cell RNA-seq Cluster Methods
    Fang, Jingwen
    Yin, Zhaohua
    Guo, Chuang
    [J]. 2ND INTERNATIONAL CONFERENCE ON FRONTIERS OF BIOLOGICAL SCIENCES AND ENGINEERING (FSBE 2019), 2020, 2208
  • [5] Generalized Cell Type Annotation and Discovery for Single-Cell RNA-Seq Data
    Zhai, Yuyao
    Chen, Liang
    Deng, Minghua
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 5402 - 5410
  • [6] scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data
    Alquicira-Hernandez, Jose
    Sathe, Anuja
    Ji, Hanlee P.
    Quan Nguyen
    Powell, Joseph E.
    [J]. GENOME BIOLOGY, 2019, 20 (01)
  • [7] SCSA: A Cell Type Annotation Tool for Single-Cell RNA-seq Data
    Cao, Yinghao
    Wang, Xiaoyue
    Peng, Gongxin
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [8] Realistic Cell Type Annotation and Discovery for Single-cell RNA-seq Data
    Zhai, Yuyao
    Chen, Liang
    Deng, Minghua
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4967 - 4974
  • [9] Analysis of Single-Cell RNA-seq Data by Clustering Approaches
    Zhu, Xiaoshu
    Li, Hong-Dong
    Guo, Lilu
    Wu, Fang-Xiang
    Wang, Jianxin
    [J]. CURRENT BIOINFORMATICS, 2019, 14 (04) : 314 - 322
  • [10] scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data
    Jose Alquicira-Hernandez
    Anuja Sathe
    Hanlee P. Ji
    Quan Nguyen
    Joseph E. Powell
    [J]. Genome Biology, 20