Gene duplication;
Structured coalescent;
Gamma distribution;
Site frequency spectrum;
Unequal recombination;
DUPLICATION;
EVOLUTION;
ADAPTATION;
D O I:
10.1016/j.tpb.2023.08.001
中图分类号:
Q14 [生态学(生物生态学)];
学科分类号:
071012 ;
0713 ;
摘要:
The Structured Coalescent was introduced to describe the coalescent process in spatially subdivided populations with migration. Here, we re-interpret migration routes of individuals in the original model as "migration routes"of single genes in tandemly arranged gene arrays. A gene copy may change its position within the array via unequal recombination. Hence, in a coalescent framework, two copies sampled from two chromosomes may coalesce only if they are at exactly homologous positions. Otherwise, one or multiple recombination events have to occur before they can coalesce, thereby increasing mean coalescence time and expected genetic diversity among the copies in a gene array. We explicitly calculate the transition probabilities on these routes backward in time. We simulate the structured coalescent with migration and coalescence rates informed by the unequal recombination process of gene copies. With this novel interpretation of population structure models we determine coalescence times and expected genetic diversity in samples of orthologous and paralogous copies from a gene family. As a case study, we discuss the site frequency spectrum of a small gene family in the two scenarios of high and of no gene copy number variation among individuals. These examples underline the significance of our model, since standard test-statistics may lead to misinterpretations when analyzing sequence data of multi-copy genes due to their different expected genetic diversity. (c) 2023 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
机构:
Univ Teknol MARA, Inst Med Mol Biotechnol, Fac Med, Jalan Hosp, Sungai Buloh 47000, MalaysiaUniv Leicester, Dept Genet, Leicester LE1 7RH, Leics, England