Evaluation of sequencing strategies for whole-genome imputation with hybrid peeling

被引:15
|
作者
Ros-Freixedes, Roger [1 ,2 ,3 ]
Whalen, Andrew [1 ,2 ]
Gorjanc, Gregor [1 ,2 ]
Mileham, Alan J. [4 ]
Hickey, John M. [1 ,2 ]
机构
[1] Univ Edinburgh, Roslin Inst, Easter Bush, Midlothian, Scotland
[2] Univ Edinburgh, Royal Dick Sch Vet Studies, Easter Bush, Midlothian, Scotland
[3] Univ Lleida, Agrotecnio Ctr, Dept Ciencia Anim, Lleida, Spain
[4] Genus Plc, 1525 River Rd, De Forest, WI 53532 USA
基金
“创新英国”项目; 英国生物技术与生命科学研究理事会;
关键词
GENOTYPE IMPUTATION; COMPLEX TRAITS; MISSING GENOTYPES; SELECTION; ANIMALS; PREDICTION; INFERENCE; MILLIONS; DESIGN; PHASE;
D O I
10.1186/s12711-020-00537-7
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Background For assembling large whole-genome sequence datasets for routine use in research and breeding, the sequencing strategy should be adapted to the methods that will be used later for variant discovery and imputation. In this study, we used simulation to explore the impact that the sequencing strategy and level of sequencing investment have on the overall accuracy of imputation using hybrid peeling, a pedigree-based imputation method that is well suited for large livestock populations. Methods We simulated marker array and whole-genome sequence data for 15 populations with simulated or real pedigrees that had different structures. In these populations, we evaluated the effect on imputation accuracy of seven methods for selecting which individuals to sequence, the generation of the pedigree to which the sequenced individuals belonged, the use of variable or uniform coverage, and the trade-off between the number of sequenced individuals and their sequencing coverage. For each population, we considered four levels of investment in sequencing that were proportional to the size of the population. Results Imputation accuracy depended greatly on pedigree depth. The distribution of the sequenced individuals across the generations of the pedigree underlay the performance of the different methods used to select individuals to sequence and it was critical for achieving high imputation accuracy in both early and late generations. Imputation accuracy was highest with a uniform coverage across the sequenced individuals of 2x rather than variable coverage. An investment equivalent to the cost of sequencing 2% of the population at 2x provided high imputation accuracy. The gain in imputation accuracy from additional investment decreased with larger populations and higher levels of investment. However, to achieve the same imputation accuracy, a proportionally greater investment must be used in the smaller populations compared to the larger ones. Conclusions Suitable sequencing strategies for subsequent imputation with hybrid peeling involve sequencing similar to 2% of the population at a uniform coverage 2x, distributed preferably across all generations of the pedigree, except for the few earliest generations that lack genotyped ancestors. Such sequencing strategies are beneficial for generating whole-genome sequence data in populations with deep pedigrees of closely related individuals.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Evaluation of sequencing strategies for whole-genome imputation with hybrid peeling
    Roger Ros-Freixedes
    Andrew Whalen
    Gregor Gorjanc
    Alan J. Mileham
    John M. Hickey
    [J]. Genetics Selection Evolution, 52
  • [2] Whole-genome sequencing strategies
    [J]. Stein, Richard, 1600, Mary Ann Liebert Inc. (34):
  • [3] Accuracy of whole-genome sequence imputation using hybrid peeling in large pedigreed livestock populations
    Roger Ros-Freixedes
    Andrew Whalen
    Ching-Yi Chen
    Gregor Gorjanc
    William O. Herring
    Alan J. Mileham
    John M. Hickey
    [J]. Genetics Selection Evolution, 52
  • [4] Accuracy of whole-genome sequence imputation using hybrid peeling in large pedigreed livestock populations
    Ros-Freixedes, Roger
    Whalen, Andrew
    Chen, Ching-Yi
    Gorjanc, Gregor
    Herring, William O.
    Mileham, Alan J.
    Hickey, John M.
    [J]. GENETICS SELECTION EVOLUTION, 2020, 52 (01)
  • [5] Evaluation of Whole-Genome Sequence Imputation Strategies in Korean Hanwoo Cattle
    Nawaz, Muhammad Yasir
    Bernardes, Priscila Arrigucci
    Savegnago, Rodrigo Pelicioni
    Lim, Dajeong
    Lee, Seung Hwan
    Gondro, Cedric
    [J]. ANIMALS, 2022, 12 (17):
  • [6] Whole-genome sequencing
    Morris, Huw R.
    Houlden, Henry
    Polke, James
    [J]. PRACTICAL NEUROLOGY, 2021, 21 (04) : 322 - +
  • [7] The Whole-Genome Sequencing and Hybrid Assembly of Mytilus coruscus
    Li, Ronghua
    Zhang, Weijia
    Lu, Junkai
    Zhang, Zhouyi
    Mu, Changkao
    Song, Weiwei
    Migaud, Herve
    Wang, Chunlin
    Bekaert, Michael
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [8] Whole-Genome Sequencing Coupled to Imputation Discovers Genetic Signals for Anthropometric Traits
    Tachmazidou, Ioanna
    Suveges, Daniel
    Min, Josine L.
    Ritchie, Graham R. S.
    Steinberg, Julia
    Walter, Klaudia
    Iotchkova, Valentina
    Schwartzentruber, Jeremy
    Huang, Jie
    Memari, Yasin
    McCarthy, Shane
    Crawford, Andrew A.
    Bombieri, Cristina
    Cocca, Massimiliano
    Farmaki, Aliki-Eleni
    Gaunt, Tom R.
    Jousilahti, Pekka
    Kooijman, Marjolein N.
    Lehne, Benjamin
    Malerba, Giovanni
    Mannisto, Satu
    Matchan, Angela
    Medina-Gomez, Carolina
    Metrustry, Sarah J.
    Nag, Abhishek
    Ntalla, Ioanna
    Paternoster, Lavinia
    Rayner, Nigel W.
    Sala, Cinzia
    Scott, William R.
    Shihab, Hashem A.
    Southam, Lorraine
    St Pourcain, Beate
    Traglia, Michela
    Trajanoska, Katerina
    Zaza, Gialuigi
    Zhang, Weihua
    Artigas, Maria S.
    Bansal, Narinder
    Benn, Marianne
    Chen, Zhongsheng
    Danecek, Petr
    Lin, Wei-Yu
    Locke, Adam
    Luan, Jian'an
    Manning, Alisa K.
    Mulas, Antonella
    Sidore, Carlo
    Tybjaerg-Hansen, Anne
    Varbo, Anette
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2017, 100 (06) : 865 - 884
  • [9] Interpreting Whole-Genome Sequencing
    Grody, Wayne W.
    Vilain, Eric
    Nelson, Stanley F.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2014, 312 (03): : 296 - 296
  • [10] Whole-genome sequencing in pharmacogeneticson
    Urban, Thomas J.
    [J]. PHARMACOGENOMICS, 2013, 14 (04) : 345 - 348