Forseti: a mechanistic and predictive model of the splicing status of scRNA-seq reads

被引:0
|
作者
He, Dongze [1 ,2 ]
Gao, Yuan [1 ,2 ]
Chan, Spencer Skylar [3 ]
Quintana-Parrilla, Natalia [4 ]
Patro, Rob [1 ,3 ]
机构
[1] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
[2] Univ Maryland, Program Computat Biol Bioinformat & Genom, College Pk, MD 20742 USA
[3] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[4] Univ Puerto Rico, Dept Biol, Mayaguez Campus, Mayaguez, PR 00682 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btae207
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Short-read single-cell RNA-sequencing (scRNA-seq) has been used to study cellular heterogeneity, cellular fate, and transcriptional dynamics. Modeling splicing dynamics in scRNA-seq data is challenging, with inherent difficulty in even the seemingly straightforward task of elucidating the splicing status of the molecules from which sequenced fragments are drawn. This difficulty arises, in part, from the limited read length and positional biases, which substantially reduce the specificity of the sequenced fragments. As a result, the splicing status of many reads in scRNA-seq is ambiguous because of a lack of definitive evidence. We are therefore in need of methods that can recover the splicing status of ambiguous reads which, in turn, can lead to more accuracy and confidence in downstream analyses. Results: We develop Forseti, a predictive model to probabilistically assign a splicing status to scRNA-seq reads. Our model has two key components. First, we train a binding affinity model to assign a probability that a given transcriptomic site is used in fragment generation. Second, we fit a robust fragment length distribution model that generalizes well across datasets deriving from different species and tissue types. Forseti combines these two trained models to predict the splicing status of the molecule of origin of reads by scoring putative fragments that associate each alignment of sequenced reads with proximate potential priming sites. Using both simulated and experimental data, we show that our model can precisely predict the splicing status of many reads and identify the true gene origin of multi-gene mapped reads.
引用
收藏
页码:i297 / i306
页数:10
相关论文
共 50 条
  • [41] Establishment of a prognostic model based on m6A regulatory factors and stemness of hepatocellular carcinoma using RNA-seq data and scRNA-seq data
    Liang, Yan
    Chen, Sen
    Xie, Jinghe
    Yan, Guanrong
    Guo, Tingting
    Li, Tianyang
    Liu, Shoupei
    Zeng, Weiping
    Zhang, Shuai
    Ma, Keqiang
    Chen, Honglin
    Ou, Yimeng
    Wang, Bailin
    Gu, Weili
    Duan, Yuyou
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2023, 149 (14) : 12881 - 12896
  • [42] Establishment of a prognostic model based on m6A regulatory factors and stemness of hepatocellular carcinoma using RNA-seq data and scRNA-seq data
    Yan Liang
    Sen Chen
    Jinghe Xie
    Guanrong Yan
    Tingting Guo
    Tianyang Li
    Shoupei Liu
    Weiping Zeng
    Shuai Zhang
    Keqiang Ma
    Honglin Chen
    Yimeng Ou
    Bailin Wang
    Weili Gu
    Yuyou Duan
    Journal of Cancer Research and Clinical Oncology, 2023, 149 : 12881 - 12896
  • [43] Unveiling mitophagy-mediated molecular heterogeneity and development of a risk signature model for colorectal cancer by integrated scRNA-seq and bulk RNA-seq analysis
    Gao, Han
    Zou, Qi
    Ma, Linyun
    Cai, Keyu
    Sun, Yi
    Lu, Li
    Ren, Donglin
    Hu, Bang
    GASTROENTEROLOGY REPORT, 2023, 11
  • [44] scRNA-seq of diffuse-type gastric cancer mouse model reveals a novel interaction between neutrophils and cancer cells
    Kakiuchi, Miwako
    Kokubo, Haruki
    Komura, Daisuke
    Katoh, Hiroto
    Ishikawa, Shumpei
    CANCER SCIENCE, 2023, 114 : 1784 - 1784
  • [45] Establishment of an ovarian cancer exhausted CD8+T cells-related genes model by integrated analysis of scRNA-seq and bulk RNA-seq
    Hua, Tian
    Liu, Deng-xiang
    Zhang, Xiao-chong
    Li, Shao-teng
    Wu, Jian-lei
    Zhao, Qun
    Chen, Shu-bo
    EUROPEAN JOURNAL OF MEDICAL RESEARCH, 2024, 29 (01) : 358
  • [46] ScRNA-seq And Flow Cytometry Based Analysis Reveals That Chronic Heart Failure Associates With Enhanced Activation And Clonal Expansion Of T Cells With Predictive Autoreactive Capacity
    Abplanalp, Wesley T.
    Merten, Maximilian
    Cremer, Sebastian
    Rasper, Tina
    Holz, Kathrin
    Schuhmacher, Bianca
    Macinkovic, Igor
    Tombor, Lukas
    John, David
    Zeiher, Andreas M.
    Dimmeler, Stefanie
    CIRCULATION RESEARCH, 2023, 133
  • [47] Divergent Fibroblast Composition, Differential Gene Expression, and Pathway Enrichment in ScRNA-seq to Explain Sexual Dimorphism During the Development of Emphysema in the Murine Model
    Islam, M.
    Zhang, Z.
    Burdick, M.
    Manichaikul, A.
    Shim, Y. M.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2023, 207
  • [48] ScRNA-seq and flow cytometry based analysis reveals that chronic heart failure associates with enhanced activation and clonal expansion of T cells with predictive autoreactive capacity
    Abplanalp, W.
    Merten, M.
    Cremer, S.
    Rasper, T.
    Holz, K.
    Schuhmacher, B.
    Macinckovic, I.
    Tombor, L. S.
    John, D.
    Hoffmann, J.
    Zeiher, A. M.
    Dimmeler, S.
    EUROPEAN HEART JOURNAL, 2023, 44
  • [49] SUMA: a lightweight machine learning model-powered shared nearest neighbour-based clustering application interface for scRNA-Seq data
    Karakurt, Hamza Umut
    Pir, Pinar
    TURKISH JOURNAL OF BIOLOGY, 2023, 47 (06)
  • [50] Single-Cell Transcriptomics of Bone Marrow Stromal Cells in Diversity Outbred Mice: A Model for Population-Level scRNA-Seq Studies
    Dillard, Luke J.
    Rosenow, Will T.
    Calabrese, Gina M.
    Mesner, Larry D.
    Al-Barghouthi, Basel M.
    Abood, Abdullah
    Farber, Emily A.
    Onengut-Gumuscu, Suna
    Tommasini, Steven M.
    Horowitz, Mark A.
    Rosen, Clifford J.
    Yao, Lutian
    Qin, Ling
    Farber, Charles R.
    JOURNAL OF BONE AND MINERAL RESEARCH, 2023, 38 (09) : 1350 - 1363