HaploCart: Human mtDNA haplogroup classification using a pangenomic reference graph human mtDNA haplogroup inference

被引:6
|
作者
Rubin, Joshua Daniel [1 ]
Vogel, Nicola Alexandra [1 ]
Gopalakrishnan, Shyam [2 ]
Sackett, Peter Wad [1 ]
Renaud, Gabriel [1 ]
机构
[1] Tech Univ Denmark, Dept Hlth Technol, Lyngby, Denmark
[2] Univ Copenhagen, Sect Hologen, Copenhagen, Denmark
关键词
MITOCHONDRIAL-DNA HAPLOGROUPS; SEQUENCE; ASSOCIATION; GENOME; RISK;
D O I
10.1371/journal.pcbi.1011148
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Author summaryPangenome graphs are powerful and relatively nascent data structures for representing an entire collection of genomic sequences and their homology. Here we present HaploCart, a tool which leverages the power of pangenomics, in conjunction with maximum-likelihood estimation, to improve human mtDNA haplotype inference on single-source samples (i.e. the sample is not a mixture of multiple contributors, be they human or contaminant). In this context, mapping to many reference genomes at once vastly reduces the Eurocentric bias inherent in contemporary methods, and also improves haplotyping performance at low coverage depths. We show that HaploCart is far more accurate than competing programs on simulated and empirical datasets, and reports clade-level posterior probabilities that accurately reflect confidence in our phylogenetic assignments. Our work can easily be generalized to other haploid markers and suggests that pangenome-based approaches combined with Bayesian methods show promise for improving inference and mitigating ethnicity-related bias in a large class of bioinformatics problems involving sequencing data. Current mitochondrial DNA (mtDNA) haplogroup classification tools map reads to a single reference genome and perform inference based on the detected mutations to this reference. This approach biases haplogroup assignments towards the reference and prohibits accurate calculations of the uncertainty in assignment. We present HaploCart, a probabilistic mtDNA haplogroup classifier which uses a pangenomic reference graph framework together with principles of Bayesian inference. We demonstrate that our approach significantly outperforms available tools by being more robust to lower coverage or incomplete consensus sequences and producing phylogenetically-aware confidence scores that are unbiased towards any haplogroup. HaploCart is available both as a command-line tool and through a user-friendly web interface. The C++ program accepts as input consensus FASTA, FASTQ, or GAM files, and outputs a text file with the haplogroup assignments of the samples along with the level of confidence in the assignments. Our work considerably reduces the amount of data required to obtain a confident mitochondrial haplogroup assignment.
引用
收藏
页数:27
相关论文
共 47 条
  • [1] mtDNA haplogroup and single nucleotide polymorphisms structure human microbiome communities
    Jun Ma
    Cristian Coarfa
    Xiang Qin
    Penelope E Bonnen
    Aleksandar Milosavljevic
    James Versalovic
    Kjersti Aagaard
    BMC Genomics, 15
  • [2] mtDNA haplogroup and single nucleotide polymorphisms structure human microbiome communities
    Ma, Jun
    Coarfa, Cristian
    Qin, Xiang
    Bonnen, Penelope E.
    Milosavljevic, Aleksandar
    Versalovic, James
    Aagaard, Kjersti
    BMC GENOMICS, 2014, 15
  • [3] Human mtDNA site-specific variability values can act as haplogroup markers
    Accetturo, Matteo
    Santamaria, Monica
    Lascaro, Daniela
    Rubino, Francesco
    Achilli, Alessandro
    Torroni, Antonio
    Tommaseo-Ponzetta, Mila
    Attimonelli, Marcella
    HUMAN MUTATION, 2006, 27 (09) : 965 - 974
  • [4] Haplogroup Classification of Korean Cattle Breeds Based on Sequence Variations of mtDNA Control Region
    Kim, Jae-Hwan
    Lee, Seong-Su
    Kim, Seung Chang
    Choi, Seong-Bok
    Kim, Su-Hyun
    Lee, Chang Woo
    Jung, Kyoung-Sub
    Kim, Eun Sung
    Choi, Young-Sun
    Kim, Sung-Bok
    Kim, Woo Hyun
    Cho, Chang-Yeon
    ASIAN-AUSTRALASIAN JOURNAL OF ANIMAL SCIENCES, 2016, 29 (05): : 624 - 630
  • [5] An update to MitoTool: Using a new scoring system for faster mtDNA haplogroup determination
    Fan, Long
    Yao, Yong-Gang
    MITOCHONDRION, 2013, 13 (04) : 360 - 363
  • [6] Lineage-specific selection in human mtDNA:: Lack of polymorphisms in a segment of MTND5 gene in haplogroup J
    Moilanen, JS
    Finnilä, S
    Majamaa, K
    MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (12) : 2132 - 2142
  • [7] mtDNA as a tool for identification of human remains - Identification using mtDNA
    Lutz, S
    Weisser, HJ
    Heizmann, J
    Pollak, S
    INTERNATIONAL JOURNAL OF LEGAL MEDICINE, 1996, 109 (04) : 205 - 209
  • [8] The co-occurrence of mtDNA mutations on different oxidative phosphorylation subunits, not detected by haplogroup analysis, affects human longevity and is population specific
    Raule, Nicola
    Sevini, Federica
    Li, Shengting
    Barbieri, Annalaura
    Tallaro, Federica
    Lomartire, Laura
    Vianello, Dario
    Montesanto, Alberto
    Moilanen, Jukka S.
    Bezrukov, Vladyslav
    Blanche, Helene
    Hervonen, Antti
    Christensen, Kaare
    Deiana, Luca
    Gonos, Efstathios S.
    Kirkwood, Tom B. L.
    Kristensen, Peter
    Leon, Alberta
    Pelicci, Pier Giuseppe
    Poulain, Michel
    Rea, Irene M.
    Remacle, Jose
    Robine, Jean Marie
    Schreiber, Stefan
    Sikora, Ewa
    Slagboom, Peternella Eline
    Spazzafumo, Liana
    Stazi, Maria Antonietta
    Toussaint, Olivier
    Vaupel, James W.
    Rose, Giuseppina
    Majamaa, Kari
    Perola, Markus
    Johnson, Thomas E.
    Bolund, Lars
    Yang, Huanming
    Passarino, Giuseppe
    Franceschi, Claudio
    AGING CELL, 2014, 13 (03) : 401 - 407
  • [9] Bayesian coalescent inference of major human mitochondrial DNA haplogroup expansions in Africa
    Atkinson, Quentin D.
    Gray, Russell D.
    Drummond, Alexei J.
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2009, 276 (1655) : 367 - 373
  • [10] Differential mitochondrial and cellular responses between H vs. J mtDNA haplogroup-containing human RPE transmitochondrial cybrid cells
    Panvini, Ana Rubin
    Gvritishvili, Anzor
    Galvan, Hannah
    Nashine, Sonali R.
    Atilano, Shari R.
    Kenney, M. Cristina
    Tombran-Tink, Joyce
    EXPERIMENTAL EYE RESEARCH, 2022, 219