AccuCalc: A Python']Python Package for Accuracy Calculation in GWAS

被引:4
|
作者
Biova, Jana [1 ]
Dietz, Nicholas [2 ]
Chan, Yen On [3 ,4 ]
Joshi, Trupti [3 ,4 ,5 ,6 ]
Bilyeu, Kristin [7 ]
Skrabisova, Maria [1 ]
机构
[1] Palacky Univ Olomouc, Fac Sci, Dept Biochem, Olomouc 78371, Czech Republic
[2] Univ Missouri, Div Plant Sci, Columbia, MO 65201 USA
[3] Univ Missouri, Christopher S Bond Life Sci Ctr, Columbia, MO 65212 USA
[4] Univ Missouri, MU Data Sci & Informat Inst, Columbia, MO 65212 USA
[5] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65212 USA
[6] Univ Missouri, Sch Med, Dept Hlth Management & Informat, Columbia, MO 65212 USA
[7] Univ Missouri, USDA ARS, Plant Genet Res Unit, Columbia, MO 65211 USA
关键词
!text type='python']python[!/text] package; GWAS; accuracy; causative mutation; SP2CM; Manhattan plot;
D O I
10.3390/genes14010123
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The genome-wide association study (GWAS) is a popular genomic approach that identifies genomic regions associated with a phenotype and, thus, aims to discover causative mutations (CM) in the genes underlying the phenotype. However, GWAS discoveries are limited by many factors and typically identify associated genomic regions without the further ability to compare the viability of candidate genes and actual CMs. Therefore, the current methodology is limited to CM identification. In our recent work, we presented a novel approach to an empowered "GWAS to Genes" strategy that we named Synthetic phenotype to causative mutation (SP2CM). We established this strategy to identify CMs in soybean genes and developed a web-based tool for accuracy calculation (AccuTool) for a reference panel of soybean accessions. Here, we describe our further development of the tool that extends its utilization for other species and named it AccuCalc. We enhanced the tool for the analysis of datasets with a low-frequency distribution of a rare phenotype by automated formatting of a synthetic phenotype and added another accuracy-based GWAS evaluation criterion to the accuracy calculation. We designed AccuCalc as a Python package for GWAS data analysis for any user-defined species-independent variant calling format (vcf) or HapMap format (hmp) as input data. AccuCalc saves analysis outputs in user-friendly tab-delimited formats and also offers visualization of the GWAS results as Manhattan plots accentuated by accuracy. Under the hood of Python, AccuCalc is publicly available and, thus, can be used conveniently for the SP2CM strategy utilization for every species.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] ADOpy: a python']python package for adaptive design optimization
    Yang, Jaeyeong
    Pitt, Mark A.
    Ahn, Woo-Young
    Myung, Jay I.
    BEHAVIOR RESEARCH METHODS, 2021, 53 (02) : 874 - 897
  • [32] serpentTools: A Python']Python Package for Expediting Analysis with Serpent
    Johnson, Andrew E.
    Kotlyar, Dan
    Terlizzi, Stefano
    Ridley, Gavin
    NUCLEAR SCIENCE AND ENGINEERING, 2020, 194 (11) : 1016 - 1024
  • [33] pyjeo: A Python']Python Package for the Analysis of Geospatial Data
    Kempeneers, Pieter
    Pesek, Ondrej
    De Marchi, Davide
    Soille, Pierre
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (10)
  • [34] pymetamodels: A Python']Python package for metamodeling and design automation
    Escribano, Nicolas
    Bielsa, Jose Manuel
    Lahuerta, Francisco
    SOFTWAREX, 2024, 26
  • [35] Pytearcat: PYthon']PYthon TEnsor AlgebRa calCulATor A python']python package for general relativity and tensor calculus
    San Martin, M.
    Sureda, J.
    ASTRONOMY AND COMPUTING, 2022, 39
  • [36] PyVisualFields: A Python']Python Package for Visual Field Analysis
    Eslami, Mohammad
    Kazeminasab, Saber
    Sharma, Vishal
    Li, Yangjiani
    Fazli, Mojtaba
    Wang, Mengyu
    Zebardast, Nazlee
    Elze, Tobias
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2023, 12 (02):
  • [37] salmon: A Symbolic Linear Regression Package for Python']Python
    Boyd, Alex
    Sun, Dennis L.
    JOURNAL OF STATISTICAL SOFTWARE, 2024, 108 (08): : 1 - 26
  • [38] A variable selection package driving Netica with Python']Python
    Beuzen, Tomas
    Simmons, Joshua
    ENVIRONMENTAL MODELLING & SOFTWARE, 2019, 115 : 1 - 5
  • [39] NetPlotBrain: A Python']Python package for visualizing networks and brains
    Fanton, Silvia
    Thompson, William Hedley
    NETWORK NEUROSCIENCE, 2023, 7 (02) : 461 - 477
  • [40] CoClust: A Python']Python Package for Co-Clustering
    Role, Francois
    Morbieu, Stanislas
    Nadif, Mohamed
    JOURNAL OF STATISTICAL SOFTWARE, 2019, 88 (07): : 1 - 29