FSCAM: CAM-Based Feature Selection for Clustering scRNA-seq

被引:4
|
作者
Wang, Yan [1 ]
Gao, Jie [1 ]
Xuan, Chenxu [1 ]
Guan, Tianhao [1 ]
Wang, Yujie [1 ]
Zhou, Gang [1 ]
Ding, Tao [2 ]
机构
[1] Jiangnan Univ, Sch Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Newcastle Univ, Sch Math Stat & Phys, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
基金
中国国家自然科学基金;
关键词
Single-cell RNA sequencing data (scRNA-seq data); Feature selection; Convex analysis of mixtures (CAM); Many-to-many" relationship; Single-cell clustering algorithm; UNSUPERVISED FEATURE-SELECTION; CELL;
D O I
10.1007/s12539-021-00495-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cell type determination based on transcriptome profiles is a key application of single-cell RNA sequencing (scRNA-seq). It is usually achieved through unsupervised clustering. Good feature selection is capable of improving the clustering accuracy and is a crucial component of single-cell clustering pipelines. However, most current single-cell feature selection methods are univariable filter methods ignoring gene dependency. Even the multivariable filter methods developed in recent years only consider "one-to-many" relationship between genes. In this paper, a novel single-cell feature selection method based on convex analysis of mixtures (FSCAM) is proposed, which takes into account "many-to-many" relationship. Compared to the previous "one-to-many" methods, FSCAM selects genes with a combination of relevancy, redundancy and completeness. Pertinent benchmarking is conducted on the real datasets to validate the superiority of FSCAM. Through plugging into the framework of partition around medoids (PAM) clustering, a single-cell clustering algorithm based on FSCAM method (SCC_FSCAM) is further developed. Comparing SCC_FSCAM with existing advanced clustering algorithms, the results show that our algorithm has advantages in both internal criteria (clustering number) and external criteria (adjusted Rand index) and has a good stability. [GRAPHICS] .
引用
收藏
页码:394 / 408
页数:15
相关论文
共 50 条
  • [1] FSCAM: CAM-Based Feature Selection for Clustering scRNA-seq
    Yan Wang
    Jie Gao
    Chenxu Xuan
    Tianhao Guan
    Yujie Wang
    Gang Zhou
    Tao Ding
    Interdisciplinary Sciences: Computational Life Sciences, 2022, 14 : 394 - 408
  • [2] Intrinsic entropy model for feature selection of scRNA-seq data
    Li, Lin
    Tang, Hui
    Xia, Rui
    Dai, Hao
    Liu, Rui
    Chen, Luonan
    JOURNAL OF MOLECULAR CELL BIOLOGY, 2022, 14 (02)
  • [3] A framework for scRNA-seq data clustering based on multi-view feature integration
    Li, Feng
    Liu, Yang
    Liu, Jinxing
    Ge, Daohui
    Shang, Junliang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [4] RFCell: A Gene Selection Approach for scRNA-seq Clustering Based on Permutation and Random Forest
    Zhao, Yuan
    Fang, Zhao-Yu
    Lin, Cui-Xiang
    Deng, Chao
    Xu, Yun-Pei
    Li, Hong-Dong
    FRONTIERS IN GENETICS, 2021, 12
  • [5] scFED: Clustering Identifying Cell Types of scRNA-Seq Data Based on Feature Engineering Denoising
    Liu, Yang
    Li, Feng
    Shang, Junliang
    Liu, Jinxing
    Wang, Juan
    Ge, Daohui
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2023, 15 (04) : 590 - 601
  • [6] GNN-based embedding for clustering scRNA-seq data
    Ciortan, Madalina
    Defrance, Matthieu
    BIOINFORMATICS, 2022, 38 (04) : 1037 - 1044
  • [7] scSFCL:Deep clustering of scRNA-seq data with subspace feature confidence learning
    Meng, Xiaokun
    Zhang, Yuanyuan
    Xu, Xiaoyu
    Zhang, Kaihao
    Feng, Baoming
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2025, 114
  • [8] scFED: Clustering Identifying Cell Types of scRNA-Seq Data Based on Feature Engineering Denoising
    Yang Liu
    Feng Li
    Junliang Shang
    Jinxing Liu
    Juan Wang
    Daohui Ge
    Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 590 - 601
  • [9] Boosting scRNA-seq data clustering by cluster-aware feature weighting
    Rui-Yi Li
    Jihong Guan
    Shuigeng Zhou
    BMC Bioinformatics, 22
  • [10] Boosting scRNA-seq data clustering by cluster-aware feature weighting
    Li, Rui-Yi
    Guan, Jihong
    Zhou, Shuigeng
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 6)