scalepopgen: Bioinformatic Workflow Resources Implemented in Nextflow for Comprehensive Population Genomic Analyses

被引:0
|
作者
Upadhyay, Maulik [1 ]
Pogorevc, Neza [1 ]
Medugorac, Ivica [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Vet Sci, Populat Genom Grp, D-82152 Martinsried, Germany
关键词
Nextflow; population genomics; signature of selection; workflows; RECENT POSITIVE SELECTION; CATTLE; VISUALIZATION; ASSOCIATION; ADAPTATION; PROGRAM; HISTORY; TOOLS; SCANS;
D O I
10.1093/molbev/msae057
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Population genomic analyses such as inference of population structure and identifying signatures of selection usually involve the application of a plethora of tools. The installation of tools and their dependencies, data transformation, or series of data preprocessing in a particular order sometimes makes the analyses challenging. While the usage of container-based technologies has significantly resolved the problems associated with the installation of tools and their dependencies, population genomic analyses requiring multistep pipelines or complex data transformation can greatly be facilitated by the application of workflow management systems such as Nextflow and Snakemake. Here, we present scalepopgen, a collection of fully automated workflows that can carry out widely used population genomic analyses on the biallelic single nucleotide polymorphism data stored in either variant calling format files or the plink-generated binary files. scalepopgen is developed in Nextflow and can be run locally or on high-performance computing systems using either Conda, Singularity, or Docker. The automated workflow includes procedures such as (i) filtering of individuals and genotypes; (ii) principal component analysis, admixture with identifying optimal K-values; (iii) running TreeMix analysis with or without bootstrapping and migration edges, followed by identification of an optimal number of migration edges; (iv) implementing single-population and pair-wise population comparison-based procedures to identify genomic signatures of selection. The pipeline uses various open-source tools; additionally, several Python and R scripts are also provided to collect and visualize the results. The tool is freely available at https://github.com/Popgen48/scalepopgen.
引用
收藏
页数:9
相关论文
共 8 条
  • [1] Improved genomic resources and new bioinformatic workflow for the carcinogenic parasite Clonorchis sinensis: Biotechnological implications
    Wang, Daxi
    Korhonen, Pasi K.
    Gasser, Robin B.
    Young, Neil D.
    BIOTECHNOLOGY ADVANCES, 2018, 36 (04) : 894 - 904
  • [2] ATAV: a comprehensive platform for population-scale genomic analyses
    Zhong Ren
    Gundula Povysil
    Joseph A. Hostyk
    Hongzhu Cui
    Nitin Bhardwaj
    David B. Goldstein
    BMC Bioinformatics, 22
  • [3] ATAV: a comprehensive platform for population-scale genomic analyses
    Ren, Zhong
    Povysil, Gundula
    Hostyk, Joseph A.
    Cui, Hongzhu
    Bhardwaj, Nitin
    Goldstein, David B.
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [4] ATAV: a comprehensive platform for population-scale genomic analyses
    Ren, Zhong (zhong.ren@hotmail.com), 1600, BioMed Central Ltd (22):
  • [5] Genomic resources for population analyses of an invasive insect pest Oryctes rhinoceros
    Filipovic, Igor
    SCIENTIFIC DATA, 2023, 10 (01)
  • [6] Genomic resources for population analyses of an invasive insect pest Oryctes rhinoceros
    Igor Filipović
    Scientific Data, 10
  • [7] Comprehensive genomic analyses associate UGT8 variants with musical ability in a Mongolian population
    Park, Hansoo
    Lee, Seungbok
    Kim, Hyun-Jin
    Ju, Young Seok
    Shin, Jong-Yeon
    Hong, Dongwan
    von Grotthuss, Marcin
    Lee, Dong-Sung
    Park, Changho
    Kim, Jennifer Hayeon
    Kim, Boram
    Yoo, Yun Joo
    Cho, Sung-Il
    Sung, Joohon
    Lee, Charles
    Kim, Jong-Il
    Seo, Jeong-Sun
    JOURNAL OF MEDICAL GENETICS, 2012, 49 (12) : 747 - 752
  • [8] Comprehensive genomic analyses of Vigna unguiculata provide insights into population differentiation and the genetic basis of key agricultural traits
    Pan, Lei
    Liu, Minghui
    Kang, Yan
    Mei, Xiang
    Hu, Gege
    Bao, Chun
    Zheng, Yu
    Zhao, Huixia
    Chen, Chanyou
    Wang, Nian
    PLANT BIOTECHNOLOGY JOURNAL, 2023, 21 (07) : 1426 - 1439