SMGR: a joint statistical method for integrative analysis of single-cell multi-omics data

被引:34
|
作者
Song, Qianqian [1 ,2 ]
Zhu, Xuewei [3 ]
Jin, Lingtao [4 ]
Chen, Minghan [5 ]
Zhang, Wei [1 ,2 ]
Su, Jing [6 ,7 ]
机构
[1] Atrium Hlth Wake Forest Baptist, Wake Forest Baptist Comprehens Canc Ctr, Ctr Canc Genom & Precis Oncol, Winston Salem, NC 27157 USA
[2] Wake Forest Sch Med, Dept Canc Biol, Winston Salem, NC 27157 USA
[3] Wake Forest Sch Med, Dept Internal Med, Sect Mol Med, Winston Salem, NC 27101 USA
[4] UT Hlth San Antonio, Dept Mol Med, San Antonio, TX 78229 USA
[5] Wake Forest Univ, Dept Comp Sci, Winston Salem, NC 27109 USA
[6] Indiana Univ Sch Med, Dept Biostat & Hlth Data Sci, Indianapolis, IN 46202 USA
[7] Wake Forest Sch Med, Sect Gerontol & Geriatr Med, Winston Salem, NC 27157 USA
关键词
VALIDATION; ACTIVATION; REVEALS; MODELS; NOISE; STATE;
D O I
10.1093/nargab/lqac056
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Unravelling the regulatory programs from single-cell multi-omics data has long been one of the major challenges in genomics, especially in the current emerging single-cell field. Currently there is a huge gap between fast-growing single-cell multi-omics data and effective methods for the integrative analysis of these inherent sparse and heterogeneous data. In this study, we have developed a novel method, Single-cell Multi-omics Gene co-Regulatory algorithm (SMGR), to detect coherent functional regulatory signals and target genes from the joint single-cell RNA-sequencing (scRNA-seq) and single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) data obtained from different samples. Given that scRNA-seq and scATAC-seq data can be captured by zero-inflated Negative Binomial distribution, we utilize a generalized linear regression model to identify the latent representation of consistently expressed genes and peaks, thus enables the identification of co-regulatory programs and the elucidation of regulating mechanisms. Results from both simulation and experimental data demonstrate that SMGR outperforms the existing methods with considerably improved accuracy. To illustrate the biological insights of SMGR, we apply SMGR to mixed-phenotype acute leukemia (MPAL) and identify the MPAL-specific regulatory program with significant peak-gene links, which greatly enhance our understanding of the regulatory mechanisms and potential targets of this complex tumor.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] Clustering single-cell multi-omics data with MoClust
    Yuan, Musu
    Chen, Liang
    Deng, Minghua
    BIOINFORMATICS, 2023, 39 (01)
  • [12] Intricacies of single-cell multi-omics data integration
    Rautenstrauch, Pia
    Vlot, Anna Hendrika Cornelia
    Saran, Sepideh
    Ohler, Uwe
    TRENDS IN GENETICS, 2022, 38 (02) : 128 - 139
  • [13] TiMEG: an integrative statistical method for partially missing multi-omics data
    Sarmistha Das
    Indranil Mukhopadhyay
    Scientific Reports, 11
  • [14] TiMEG: an integrative statistical method for partially missing multi-omics data
    Das, Sarmistha
    Mukhopadhyay, Indranil
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [15] Clustering of single-cell multi-omics data with a multimodal deep learning method
    Xiang Lin
    Tian Tian
    Zhi Wei
    Hakon Hakonarson
    Nature Communications, 13
  • [16] Clustering of single-cell multi-omics data with a multimodal deep learning method
    Lin, Xiang
    Tian, Tian
    Wei, Zhi
    Hakonarson, Hakon
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [17] Scbean: a python']python library for single-cell multi-omics data analysis
    Zhang, Haohui
    Wang, Yuwei
    Lian, Bin
    Wang, Yiran
    Li, Xingyi
    Wang, Tao
    Shang, Xuequn
    Yang, Hui
    Aziz, Ahmad
    Hu, Jialu
    BIOINFORMATICS, 2024, 40 (02)
  • [18] Integration of single-cell multi-omics data by regression analysis on unpaired observations
    Qiuyue Yuan
    Zhana Duren
    Genome Biology, 23
  • [19] Paired single-cell multi-omics data integration with Mowgli
    Huizing, Geert-Jan
    Deutschmann, Ina Maria
    Peyre, Gabriel
    Cantini, Laura
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [20] Unsupervised manifold alignment for single-cell multi-omics data
    Singh, Ritambhara
    Demetci, Pinar
    Bonora, Giancarlo
    Ramani, Vijay
    Lee, Choli
    Fang, He
    Duan, Zhijun
    Deng, Xinxian
    Shendure, Jay
    Disteche, Christine
    Noble, William Stafford
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,