Mutation parameters from DNA sequence data using graph theoretic measures on lineage trees

被引:7
|
作者
Magori-Cohen, Reuma
Louzoun, Yoram
Kleinstein, Steven H. [1 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[2] Bar Ilan Univ, Dept Math, IL-52900 Ramat Gan, Israel
关键词
D O I
10.1093/bioinformatics/btl239
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: B cells responding to antigenic stimulation can fine-tune their binding properties through a process of affinity maturation composed of somatic hypermutation, affinity-selection and clonal expansion. The mutation rate of the B cell receptor DNA sequence, and the effect of these mutations on affinity and specificity, are of critical importance for understanding immune and autoimmune processes. Unbiased estimates of these properties are currently lacking due to the short time-scales involved and the small numbers of sequences available. Results: We have developed a bioinformatic method based on a maximum likelihood analysis of phylogenetic lineage trees to estimate the parameters of a B cell clonal expansion model, which includes somatic hypermutation with the possibility of lethal mutations. Lineage trees are created from clonally related B cell receptor DNA sequences. Important links between tree shapes and underlying model parameters are identified using mutual information. Parameters are estimated using a likelihood function based on the joint distribution of several tree shapes, without requiring a priori knowledge of the number of generations in the clone ( which is not available for rapidly dividing populations in vivo). A systematic validation on synthetic trees produced by a mutating birth-death process simulation shows that our estimates are precise and robust to several underlying assumptions. These methods are applied to experimental data from autoimmune mice to demonstrate the existence of hypermutating B cells in an unexpected location in the spleen.
引用
收藏
页码:E332 / E340
页数:9
相关论文
共 50 条
  • [1] A graph theoretic approach to the analysis of DNA sequencing data
    Berno, AJ
    [J]. GENOME RESEARCH, 1996, 6 (02): : 80 - 91
  • [2] Data Lineage Temporally Using a Graph Database
    Pokorny, Jaroslav
    Sykora, Jan
    Valenta, Michal
    [J]. 11TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS (MEDES), 2019, : 285 - 291
  • [3] ACCURACY OF PHYLOGENETIC TREES ESTIMATED FROM DNA-SEQUENCE DATA
    SOURDIS, J
    KRIMBAS, C
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1987, 4 (02) : 159 - 166
  • [4] Clustering gene expression data using a graph-theoretic approach: an application of minimum spanning trees
    Xu, Y
    Olman, V
    Xu, D
    [J]. BIOINFORMATICS, 2002, 18 (04) : 536 - 545
  • [5] Estimating species trees using multiple-allele DNA sequence data
    Liu, Liang
    Pearl, Dennis K.
    Brumfield, Robb T.
    Edwards, Scott V.
    [J]. EVOLUTION, 2008, 62 (08) : 2080 - 2091
  • [6] Validation of Transgenic rodent gene mutation assays using DNA sequence data
    Douglas, G. R.
    Soper, L. M.
    Singer, T. M.
    [J]. ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2008, 49 (07) : 559 - 559
  • [7] Reconstructing complex lineage trees from scRNA-seq data using MERLoT
    Parra, R. Gonzalo
    Papadopoulos, Nikolaos
    Ahumada-Arranz, Laura
    El Kholtei, Jakob
    Mottelson, Noah
    Horokhovsky, Yehor
    Treutlein, Barbara
    Soeding, Johannes
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (17) : 8961 - 8974
  • [8] A DISTANCE METRIC ON BINARY-TREES USING LATTICE-THEORETIC MEASURES
    PALLO, JM
    [J]. INFORMATION PROCESSING LETTERS, 1990, 34 (03) : 113 - 116
  • [9] DnaSP, DNA sequence polymorphism: An interactive program for estimating population genetics parameters from DNA sequence data
    Rozas, J
    Rozas, R
    [J]. COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1995, 11 (06): : 621 - 625
  • [10] CALCULATION OF EVOLUTIONARY TREES FROM SEQUENCE DATA
    KLOTZ, LC
    KOMAR, N
    BLANKEN, RL
    MITCHELL, RM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1979, 76 (09) : 4516 - 4520