SLIVER: Unveiling large scale gene regulatory networks of single-cell transcriptomic data through causal structure learning and modules aggregation

被引:1
|
作者
Jiang H. [1 ]
Wang Y. [1 ]
Yin C. [1 ]
Pan H. [2 ]
Chen L. [1 ]
Feng K. [1 ]
Chang Y. [1 ,3 ,4 ]
Sun H. [1 ,3 ,4 ]
机构
[1] School of Artificial Intelligence, Jilin University, Changchun
[2] College of Software, Jilin University, Changchun
[3] International Center of Future Science, Jilin University, Changchun
[4] Engineering Research Center of Knowledge-Driven Human-Machine Intelligence, MOE
基金
中国国家自然科学基金;
关键词
Causal discovery; Directed acyclic graphs; Gene regulatory network; Large-scale network; Latent low-dimensional space;
D O I
10.1016/j.compbiomed.2024.108690
中图分类号
学科分类号
摘要
Prevalent Gene Regulatory Network (GRN) construction methods rely on generalized correlation analysis. However, in biological systems, regulation is essentially a causal relationship that cannot be adequately captured solely through correlation. Therefore, it is more reasonable to infer GRNs from a causal perspective. Existing causal discovery algorithms typically rely on Directed Acyclic Graphs (DAGs) to model causal relationships, but it often requires traversing the entire network, which result in computational demands skyrocketing as the number of nodes grows and make causal discovery algorithms only suitable for small networks with one or two hundred nodes or fewer. In this study, we propose the SLIVER (cauSaL dIscovery Via dimEnsionality Reduction) algorithm which integrates causal structural equation model and graph decomposition. SLIVER introduces a set of factor nodes, serving as abstractions of different functional modules to integrate the regulatory relationships between genes based on their respective functions or pathways, thus reducing the GRN to the product of two low-dimensional matrices. Subsequently, we employ the structural causal model (SCM) to learn the GRN within the gene node space, enforce the DAG constraint in the low-dimensional space, and guide each factor to aggregate various functions through cosine similarity. We evaluate the performance of the SLIVER algorithm on 12 real single cell transcriptomic datasets, and demonstrate it outperforms other 12 widely used methods both in GRN inference performance and computational resource usage. The analysis of the gene information integrated by factor nodes also demonstrate the biological explanation of factor nodes in GRNs. We apply it to scRNA-seq of Type 2 diabetes mellitus to capture the transcriptional regulatory structural changes of β cells under high insulin demand. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [21] Coupled Single-Cell CRISPR Screening and Epigenomic Profiling Reveals Causal Gene Regulatory Networks
    Rubin, Adam J.
    Parker, Kevin R.
    Satpathy, Ansuman T.
    Qi, Yanyan
    Wu, Beijing
    Ong, Alvin J.
    Mumbach, Maxwell R.
    Ji, Andrew L.
    Kim, Daniel S.
    Cho, Seung Woo
    Zarnegar, Brian J.
    Greenleaf, William J.
    Chang, Howard Y.
    Khavari, Paul A.
    CELL, 2019, 176 (1-2) : 361 - +
  • [22] Inferring gene regulatory networks from single-cell data: a mechanistic approach
    Herbach, Ulysse
    Bonnaffoux, Arnaud
    Espinasse, Thibault
    Gandrillon, Olivier
    BMC SYSTEMS BIOLOGY, 2017, 11
  • [23] Integrated Pipelines for Inferring Gene Regulatory Networks from Single-Cell Data
    Chen, Aimin
    Zhou, Tianshou
    Tian, Tianhai
    CURRENT BIOINFORMATICS, 2022, 17 (07) : 559 - 564
  • [24] How to build regulatory networks from single-cell gene expression data
    Pratapa, Aditya
    Jalihal, Amogh P.
    Law, Jeffrey N.
    Bharadwaj, Aditya
    Murali, T. M.
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [25] Learning interpretable cellular and gene signature embeddings from single-cell transcriptomic data
    Zhao, Yifan
    Cai, Huiyu
    Zhang, Zuobai
    Tang, Jian
    Li, Yue
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [26] Finding regulatory modules through large-scale gene-expression data analysis
    Kloster, M
    Tang, C
    Wingreen, NS
    BIOINFORMATICS, 2005, 21 (07) : 1172 - 1179
  • [27] Gene knockout inference with variational graph autoencoder learning single-cell gene regulatory networks
    Yang, Yongjian
    Li, Guanxun
    Zhong, Yan
    Xu, Qian
    Chen, Bo-Jia
    Lin, Yu-Te
    Chapkin, Robert S.
    Cai, James J.
    NUCLEIC ACIDS RESEARCH, 2023, 51 (13) : 6578 - 6592
  • [28] SCANPY: large-scale single-cell gene expression data analysis
    Wolf, F. Alexander
    Angerer, Philipp
    Theis, Fabian J.
    GENOME BIOLOGY, 2018, 19
  • [29] SCANPY: large-scale single-cell gene expression data analysis
    F. Alexander Wolf
    Philipp Angerer
    Fabian J. Theis
    Genome Biology, 19
  • [30] Inferring Causal Gene Regulatory Networks from Coupled Single-Cell Expression Dynamics Using Scribe
    Qiu, Xiaojie
    Rahimzamani, Arman
    Wang, Li
    Ren, Bingcheng
    Mao, Qi
    Durham, Timothy
    McFaline-Figueroa, Jose L.
    Saunders, Lauren
    Trapnell, Cole
    Kannan, Sreeram
    CELL SYSTEMS, 2020, 10 (03) : 265 - +