A method for positive forensic identification of samples from extremely low-coverage sequence data

被引：13

作者：

Vohr, Samuel H. ^{[1
]}

Najar, Carlos Fernando Buen Abad ^{[2
]}

Shapiro, Beth ^{[3
]}

Green, Richard E. ^{[1
]}

机构：

[1] Univ Calif Santa Cruz, Dept Biomol Engn, Santa Cruz, CA 95064 USA

[2] Univ Nacl Autonoma Mexico, Fac Ciencias, Mexico City 04510, DF, Mexico

[3] Univ Calif Santa Cruz, Dept Ecol & Evolutionary Biol, Santa Cruz, CA 95064 USA

来源：

BMC GENOMICS | 2015年 / 16卷

关键词：

Forensics; Ancient DNA; Genomics; GENOME SEQUENCE; DNA; ANCIENT; ENRICHMENT; HAPLOTYPE;

D O I：

10.1186/s12864-015-2241-6

中图分类号：

Q81 [生物工程学（生物技术）]; Q93 [微生物学];

学科分类号：

071005 ; 0836 ; 090102 ; 100705 ;

摘要：

Background: Determining whether two DNA samples originate from the same individual is difficult when the amount of retrievable DNA is limited. This is often the case for ancient, historic, and forensic samples. The most widely used approaches rely on amplification of a defined panel of multi-allelic markers and comparison to similar data from other samples. When the amount retrievable DNA is low these approaches fail. Results: We describe a new method for assessing whether shotgun DNA sequence data from two samples are consistent with originating from the same or different individuals. Our approach makes use of the large catalogs of single nucleotide polymorphism (SNP) markers to maximize the chances of observing potentially discriminating alleles. We further reduce the amount of data required by taking advantage of patterns of linkage disequilibrium modeled by a reference panel of haplotypes to indirectly compare observations at pairs of linked SNPs. Using both coalescent simulations and real sequencing data from modern and ancient sources, we show that this approach is robust with respect to the reference panel and has power to detect positive identity from DNA libraries with less than 1 % random and non-overlapping genome coverage in each sample. Conclusion: We present a powerful new approach that can determine whether DNA from two samples originated from the same individual even when only minute quantities of DNA are recoverable from each.

引用

页数：11

共 50 条

[21] Estimating microhaplotype allele frequencies from low-coverage or pooled sequencing data
Delomas, Thomas A.
Willis, Stuart C.
BMC BIOINFORMATICS, 2023, 24 (01)
[22] Estimating microhaplotype allele frequencies from low-coverage or pooled sequencing data
Thomas A. Delomas
Stuart C. Willis
BMC Bioinformatics, 24
[23] Characterizing Bias in Population Genetic Inferences from Low-Coverage Sequencing Data
Han, Eunjung
Sinsheimer, Janet S.
Novembre, John
MOLECULAR BIOLOGY AND EVOLUTION, 2014, 31 (03) : 723 - 735
[24] AD-LIBS: inferring ancestry across hybrid genomes using low-coverage sequence data
Nathan K. Schaefer
Beth Shapiro
Richard E. Green
BMC Bioinformatics, 18
[25] Robust Linear Trend Test for Low-Coverage Next-Generation Sequence Data Controlling for Covariates
Lee, Jung Yeon
Kim, Myeong-Kyu
Kim, Wonkuk
MATHEMATICS, 2020, 8 (02)
[26] AD-LIBS: inferring ancestry across hybrid genomes using low-coverage sequence data
Schaefer, Nathan K.
Shapiro, Beth
Green, Richard E.
BMC BIOINFORMATICS, 2017, 18
[27] Transmission Disequilibrium Tests Based on Read Counts for Low-Coverage Next-Generation Sequence Data
Kim, Wonkuk
HUMAN HEREDITY, 2015, 80 (01) : 36 - 49
[28] GENE PREDICTION AND ANNOTATION IN PENSTEMON (PLANTAGINACEAE): A WORKFLOW FOR MARKER DEVELOPMENT FROM EXTREMELY LOW-COVERAGE GENOME SEQUENCING
Blischak, Paul D.
Wenzel, Aaron J.
Wolfe, Andrea D.
APPLICATIONS IN PLANT SCIENCES, 2014, 2 (12):
[29] Best practices for genotype imputation from low-coverage sequencing data in natural populations
Watowich, Marina M.
Chiou, Kenneth L.
Graves, Brian
Montague, Michael J.
Brent, Lauren J. N.
Higham, James P.
Horvath, Julie E.
Lu, Amy
Martinez, Melween I.
Platt, Michael L.
Schneider-Crease, India A.
Lea, Amanda J.
Snyder-Mackler, Noah
MOLECULAR ECOLOGY RESOURCES, 2023,
[30] IdentiCS -: Identification of coding sequence and in silico reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence -: art. no. 112
Sun, JB
Zeng, AP
BMC BIOINFORMATICS, 2004, 5 (1)

← 1 2 3 4 5 →