The (in)dependence of single-cell data inferences on model constructs

被引:0
|
作者
Grgicak, Catherine M. [1 ,2 ]
Slooten, Klaas [3 ,4 ]
Cowell, Robert G. [5 ]
Bhembe, Qhawe [2 ]
Lun, Desmond S. [2 ,6 ]
机构
[1] Rutgers State Univ, Dept Chem, Program Forens Sci, Camden, NJ 08102 USA
[2] Rutgers State Univ, Ctr Computat & Integrat Biol, Camden, NJ 08102 USA
[3] Netherlands Forens Inst, POB 24044, NL-2490 AA The Hague, Netherlands
[4] Vrije Univ Amsterdam, De Boelelaan 1081, NL-1081 HV Amsterdam, Netherlands
[5] City Univ London, London, England
[6] Rutgers State Univ, Dept Comp Sci, 315 Penn St R306A, Camden, NJ 08102 USA
关键词
Forensic DNA; Single-cell forensics; Single-cell genetics; Single-cell inference; Likelihood ratio; Probabilistic genotyping; EESCIt; TD; DCM; LR calibration; DEVELOPMENTAL VALIDATION; DNA MIXTURES; LOW-TEMPLATE; MULTIPLEX; PROPOSITIONS; SYSTEM; NUMBER; LEVEL;
D O I
10.1016/j.fsigen.2024.103220
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Recent developments in single-cell analysis have revolutionized basic research and have garnered the attention of the forensic domain. Though single-cell analysis is not new to forensics, the ways in which these data can be generated and interpreted are. Modern interpretation strategies report likelihood ratios that rely on a model of the world that is a simplification of it. It is, therefore, plausible that different reasonable models will assign noticeably different weights of evidence (WoEs) to some of these data, resulting in inconsistent reports and protracted reviews of that evidence, potentially across years. With one goal of research being to identify and understand sources of inconsistencies during early stages, we undertake a study that evaluates WoE at the limit of one single-cell electropherogram (scEPG) across three architecturally distinct probabilistic models. The three are named EESCIt (Evidentiary Evaluation of Single Cells), TD (Top-Down), and DCM (Discrete Cell Model). To do this, we performance test the three models on a set of 996 individual scEPGs and conduct one H-1-true, i.e., true contributor, and 201 H-2-true, i.e., false contributor, tests, per scEPG. With the 201,192 outcomes per model, we confirm that scEPGs well resolve the hypotheses, regardless of what model was applied. We also observe that WoEs increase, on average, by 1 for every 1000 RFU of total intensity added until a plateau near the logarithm of the inverse of the random match probability is reached at ca. 22,000 RFU. By querying WoE calibration for each model, we determine if the evidence is over- or under-stated for any one of them. We find that for WoE >= -1 hardly any calibration discrepancy is observed. There were rare instances, however, for which WoEs that were <= -1 too strongly pointed in the negative direction, though H-1 was true. This was the result of five scEPGs that not only exhibited extreme signal in stutter positions, but also carried little information in other loci. These findings show that all three models appropriately stated WoEs for scEPGs when reporting positive WoE, and the two continuous model's WoE reasonably represented the findings when WoE < -1 for most loci. To further explore, we continued with paired analyses that evaluated the agreement in WoE, per scEPG, across models. Unlike unpaired analyses, this evaluation determines if well performing models return equivalent results for the same scEPG. The paired analysis was summarized by way of intraclass correlations, which were at least 0.99997. Further, we found that 762 of 996 WoEs were within a range of 3 orders of magnitude of each other, though many of these were associated with WoEs that were large, i.e., > 9, in the first instance. When we more closely focus on scEPGs giving ranges >= 3, but whose WoE <= 9 for at least one of the models, we find there are 21 of them. When we perform a locus-by-locus investigation of these 21 and of the five scEPGs returning too strong negative WoE for true contributors we find that extreme stutter is usually the cause of the challenges. To ameliorate differences in predicting rare, though impactful, events we proffer interpretive adaptions that extend beyond manually addressing the phenomena. With the WoE being calibrated within their relevant regions across EESCIt, TD and DCM, we categorize each as meeting the pillar of legitimacy for single-cell data within their intended WoE ranges.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Semisupervised Generative Autoencoder for Single-Cell Data
    Trung Ngo Trong
    Mehtonen, Juha
    Gonzalez, Gerardo
    Kramer, Roger
    Hautamaki, Ville
    Heinaniemi, Merja
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (08) : 1190 - 1203
  • [32] scPerturb: harmonized single-cell perturbation data
    Peidli, Stefan
    Green, Tessa D.
    Shen, Ciyue
    Gross, Torsten
    Min, Joseph
    Garda, Samuele
    Yuan, Bo
    Schumacher, Linus J.
    Taylor-King, Jake P.
    Marks, Debora S.
    Luna, Augustin
    Bluethgen, Nils
    Sander, Chris
    NATURE METHODS, 2024, 21 (03) : 531 - 540
  • [33] scPerturb: harmonized single-cell perturbation data
    Stefan Peidli
    Tessa D. Green
    Ciyue Shen
    Torsten Gross
    Joseph Min
    Samuele Garda
    Bo Yuan
    Linus J. Schumacher
    Jake P. Taylor-King
    Debora S. Marks
    Augustin Luna
    Nils Blüthgen
    Chris Sander
    Nature Methods, 2024, 21 : 531 - 540
  • [34] Statistical analysis of single-cell protein data
    Fridley, Brooke L.
    Vandekar, Simon
    Chervoneva, Inna
    Wrobel, Julia
    Ma, Siyuan
    BIOCOMPUTING 2024, PSB 2024, 2024, : 654 - 660
  • [35] Single-cell Transcriptome Study as Big Data
    Pingjian Yu
    Wei Lin
    Genomics,Proteomics & Bioinformatics, 2016, (01) : 21 - 30
  • [36] A review on integration methods for single-cell data
    Pan D.
    Li H.
    Liu H.
    Sun X.
    1600, West China Hospital, Sichuan Institute of Biomedical Engineering (38): : 1010 - 1017
  • [37] Benchmark of Data Integration in Single-Cell Proteomics
    Gong, Yaguo
    Dai, Yangbo
    Wu, Qibiao
    Guo, Li
    Yao, Xiaojun
    Yang, Qingxia
    ANALYTICAL CHEMISTRY, 2025, 97 (02) : 1254 - 1263
  • [38] Integrated analysis of multimodal single-cell data
    Hao, Yuhan
    Hao, Stephanie
    Andersen-Nissen, Erica
    Mauck, William M. I. I. I. I. I. I.
    Zheng, Shiwei
    Butler, Andrew
    Lee, Maddie J.
    Wilk, Aaron J.
    Darby, Charlotte
    Zager, Michael
    Hoffman, Paul
    Stoeckius, Marlon
    Papalexi, Efthymia
    Mimitou, Eleni P.
    Jain, Jaison
    Srivastava, Avi
    Stuart, Tim
    Fleming, Lamar M.
    Yeung, Bertrand
    Rogers, Angela J.
    McElrath, Juliana M.
    Blish, Catherine A.
    Gottardo, Raphael
    Smibert, Peter
    Satija, Rahul
    CELL, 2021, 184 (13) : 3573 - +
  • [39] Cell motility in a new single-cell wound model
    Ohtera, K
    Luo, ZP
    Couvreur, PJJ
    An, KN
    IN VITRO CELLULAR & DEVELOPMENTAL BIOLOGY-ANIMAL, 2001, 37 (07) : 414 - 418
  • [40] Cell motility in a new single-cell wound model
    Ohtera K.
    Luo Z.-P.
    Couvreur P.J.J.
    An K.-N.
    In Vitro Cellular & Developmental Biology - Animal, 2001, 37 (7) : 414 - 418