High-Order Correlation Integration for Single-Cell or Bulk RNA-seq Data Analysis

被引:8
|
作者
Tang, Hui [1 ]
Zeng, Tao [1 ]
Chen, Luonan [1 ,2 ,3 ,4 ]
机构
[1] Univ Chinese Acad Sci, CAS Ctr Excellence Mol Cell Sci, Inst Biochem & Cell Biol, Shanghai Inst Biol Sci,Key Lab Syst Biol,Chinese, Shanghai, Peoples R China
[2] Chinese Acad Sci, CAS Ctr Excellence Anim Evolut & Genet, Kunming, Yunnan, Peoples R China
[3] ShanghaiTech Univ, Sch Life Sci & Technol, Shanghai, Peoples R China
[4] Shanghai Res Ctr Brain Sci & Brain Inspired Intel, Shanghai, Peoples R China
来源
FRONTIERS IN GENETICS | 2019年 / 10卷
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
high-order; integration; clustering; single-cell; bulk data analysis; GENE-EXPRESSION; SIGNALING PATHWAYS; DISCOVERY; MODULES; HETEROGENEITY; EMBRYOS; COLON; MAPK;
D O I
10.3389/fgene.2019.00371
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Quantifying or labeling the sample type with high quality is a challenging task, which is a key step for understanding complex diseases. Reducing noise pollution to data and ensuring the extracted intrinsic patterns in concordance with the primary data structure are important in sample clustering and classification. Here we propose an effective data integration framework named as HCI (High-order Correlation Integration), which takes an advantage of high-order correlation matrix incorporated with pattern fusion analysis (PFA), to realize high-dimensional data feature extraction. On the one hand, the high-order Pearson's correlation coefficient can highlight the latent patterns underlying noisy input datasets and thus improve the accuracy and robustness of the algorithms currently available for sample clustering. On the other hand, the PFA can identify intrinsic sample patterns efficiently from different input matrices by optimally adjusting the signal effects. To validate the effectiveness of our new method, we firstly applied HCI on four single-cell RNA-seq datasets to distinguish the cell types, and we found that HCI is capable of identifying the prior-known cell types of single-cell samples from scRNA-seq data with higher accuracy and robustness than other methods under different conditions. Secondly, we also integrated heterogonous omics data from TCGA datasets and GEO datasets including bulk RNA-seq data, which outperformed the other methods at identifying distinct cancer subtypes. Within an additional case study, we also constructed the mRNA-miRNA regulatory network of colorectal cancer based on the feature weight estimated from HCI, where the differentially expressed mRNAs and miRNAs were significantly enriched in well-known functional sets of colorectal cancer, such as KEGG pathways and IPA disease annotations. All these results supported that HCI has extensive flexibility and applicability on sample clustering with different types and organizations of RNA-seq data.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Practical Compass of Single-Cell RNA-Seq Analysis
    Okada, Hiroyuki
    Chung, Ung-il
    Hojo, Hironori
    CURRENT OSTEOPOROSIS REPORTS, 2024, 22 (05) : 433 - 440
  • [42] Embracing the dropouts in single-cell RNA-seq analysis
    Peng Qiu
    Nature Communications, 11
  • [43] Integrating Single-Cell RNA-Seq and Bulk RNA-Seq Data to Explore the Key Role of Fatty Acid Metabolism in Breast Cancer
    Chen, Yongxing
    Wu, Wei
    Jin, Chenxin
    Cui, Jiaxue
    Diao, Yizhuo
    Wang, Ruiqi
    Xu, Rongxuan
    Yao, Zhihan
    Li, Xiaofeng
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (17)
  • [44] Integration of electrophysiological recordings with single-cell RNA-seq data identifies neuronal subtypes
    Fuzik, Janos
    Zeisel, Amit
    Mate, Zoltan
    Calvigioni, Daniela
    Yanagawa, Yuchio
    Szabo, Gabor
    Linnarsson, Sten
    Harkany, Tibor
    NATURE BIOTECHNOLOGY, 2016, 34 (02) : 175 - +
  • [45] Integrating single-cell RNA-Seq and bulk RNA-Seq data to explore the key role of fatty acid metabolism in hepatocellular carcinoma
    Dai, Hua
    Tao, Xin
    Shu, Yuansen
    Liu, Fanrong
    Cheng, Xiaoping
    Li, Xiushen
    Shu, Bairui
    Luo, Hongcheng
    Chen, Xuxiang
    Cheng, Zhaorui
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [46] SINGLE-CELL ANALYSIS From single-cell RNA-seq to transcriptional regulation
    La Manno, Gioele
    NATURE BIOTECHNOLOGY, 2019, 37 (12) : 1421 - 1422
  • [47] Complementing single-cell RNA-seq using bulk transcriptional profiles
    Haynes, Winston A.
    Vallania, Francesco
    Khatri, Purvesh
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1446 - 1450
  • [48] Embracing the dropouts in single-cell RNA-seq analysis
    Qiu, Peng
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [49] Bayesian correlation is a robust gene similarity measure for single-cell RNA-seq data
    Sanchez-Taltavull, Daniel
    Perkins, Theodore J.
    Dommann, Noelle
    Melin, Nicolas
    Keogh, Adrian
    Candinas, Daniel
    Stroka, Deborah
    Beldi, Guido
    NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (01)
  • [50] Noise regularization removes correlation artifacts in single-cell RNA-seq data preprocessing
    Zhang, Ruoyu
    Atwal, Gurinder S.
    Lim, Wei Keat
    PATTERNS, 2021, 2 (03):