An information-theoretic approach to single cell sequencing analysis

被引:3
|
作者
Casey, Michael J. [1 ,2 ]
Fliege, Joerg [1 ]
Sanchez-Garcia, Ruben J. [1 ,2 ,3 ]
MacArthur, Ben D. [1 ,2 ,3 ,4 ]
机构
[1] Univ Southampton, Math Sci, Southampton, England
[2] Univ Southampton, Inst Life Sci, Southampton, England
[3] Alan Turing Inst, London, England
[4] Univ Southampton, Fac Med, Ctr Human Dev Stem Cells & Regenerat, Southampton, England
基金
英国工程与自然科学研究理事会;
关键词
RNA-SEQ; INFERENCE; ALGORITHM; NOISE;
D O I
10.1186/s12859-023-05424-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundSingle-cell sequencing (sc-Seq) experiments are producing increasingly large data sets. However, large data sets do not necessarily contain large amounts of information.ResultsHere, we formally quantify the information obtained from a sc-Seq experiment and show that it corresponds to an intuitive notion of gene expression heterogeneity. We demonstrate a natural relation between our notion of heterogeneity and that of cell type, decomposing heterogeneity into that component attributable to differential expression between cell types (inter-cluster heterogeneity) and that remaining (intra-cluster heterogeneity). We test our definition of heterogeneity as the objective function of a clustering algorithm, and show that it is a useful descriptor for gene expression patterns associated with different cell types.ConclusionsThus, our definition of gene heterogeneity leads to a biologically meaningful notion of cell type, as groups of cells that are statistically equivalent with respect to their patterns of gene expression. Our measure of heterogeneity, and its decomposition into inter- and intra-cluster, is non-parametric, intrinsic, unbiased, and requires no additional assumptions about expression patterns. Based on this theory, we develop an efficient method for the automatic unsupervised clustering of cells from sc-Seq data, and provide an R package implementation.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Information-theoretic approach to network modularity
    Ziv, E
    Middendorf, M
    Wiggins, CH
    PHYSICAL REVIEW E, 2005, 71 (04)
  • [22] OBJECTIONS TO AN INFORMATION-THEORETIC APPROACH TO SYNCHRONICITY
    GATLIN, LL
    JOURNAL OF THE AMERICAN SOCIETY FOR PSYCHICAL RESEARCH, 1979, 73 (03): : 320 - 325
  • [23] OBJECTIONS TO AN INFORMATION-THEORETIC APPROACH TO SYNCHRONICITY
    BRAUDE, SE
    JOURNAL OF THE AMERICAN SOCIETY FOR PSYCHICAL RESEARCH, 1979, 73 (02): : 179 - 193
  • [24] An information-theoretic approach to active vision
    Boccignone, G
    Ferraro, M
    Caelli, T
    11TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2001, : 340 - 345
  • [25] An Information-Theoretic Approach to Portfolio Optimization
    Djakam, W. Ngambou
    Tanik, Murat M.
    SOUTHEASTCON 2022, 2022, : 332 - 338
  • [26] Prior probabilities: An information-theoretic approach
    Goyal, P
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2005, 803 : 366 - 373
  • [27] Information-Theoretic Approach to A/D Conversion
    Ignjatovic, Zeljko
    Sterling, Mark
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2013, 60 (09) : 2249 - 2262
  • [28] An information-theoretic approach for argument interpretation
    Inoue Communications Foundation and Sapporo International Communication Plaza Foundation; International Communications Foundation; Japanese Society for Artificial Intelligence; Support Center for Advanced Telecommunications Technology Research (Association for Computational Linguistics (ACL)):
  • [29] Information-theoretic approach to interactive learning
    Still, S.
    EPL, 2009, 85 (02)
  • [30] An Information-Theoretic approach for Bug Triaging
    Yadav, Asmita
    Singh, Sandccp Kumar
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 7 - 13