The Allele Catalog Tool: a web-based interactive tool for allele discovery and analysis

被引:4
|
作者
Chan, Yen On [1 ,2 ]
Dietz, Nicholas [3 ]
Zeng, Shuai [4 ]
Wang, Juexin [2 ,4 ]
Flint-Garcia, Sherry [5 ]
Salazar-Vidal, M. Nancy [3 ,6 ]
Skrabisova, Maria [7 ]
Bilyeu, Kristin [5 ]
Joshi, Trupti [1 ,2 ,4 ,8 ]
机构
[1] Univ Missouri Columbia, MU Inst Data Sci & Informat, Columbia, MO 65211 USA
[2] Univ Missouri Columbia, Christopher S Bond Life Sci Ctr, Columbia, MO 65211 USA
[3] Univ Missouri Columbia, Div Plant Sci & Technol, Columbia, MO USA
[4] Univ Missouri Columbia, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
[5] USDA ARS, Plant Genet Res Unit, Columbia, MO 65211 USA
[6] Univ Calif Davis, Dept Evolut & Ecol, Davis, CA USA
[7] Palacky Univ Olomouc, Fac Sci, Dept Biochem, Olomouc, Czech Republic
[8] Univ Missouri Columbia, Dept Hlth Management & Informat, Columbia, MO 65211 USA
关键词
Variant Calling Pipeline; Allele Catalog Pipeline; Allele Catalog Tool; Alleles in Gene; Data Visualization; SEED DORMANCY; ARABIDOPSIS; ADAPTATION; DOG1;
D O I
10.1186/s12864-023-09161-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background The advancement of sequencing technologies today has made a plethora of whole-genome re-sequenced (WGRS) data publicly available. However, research utilizing the WGRS data without further configuration is nearly impossible. To solve this problem, our research group has developed an interactive Allele Catalog Tool to enable researchers to explore the coding region allelic variation present in over 1,000 re-sequenced accessions each for soybean, Arabidopsis, and maize. Results The Allele Catalog Tool was designed originally with soybean genomic data and resources. The Allele Catalog datasets were generated using our variant calling pipeline (SnakyVC) and the Allele Catalog pipeline (AlleleCatalog). The variant calling pipeline is developed to parallelly process raw sequencing reads to generate the Variant Call Format (VCF) files, and the Allele Catalog pipeline takes VCF files to perform imputations, functional effect predictions, and assemble alleles for each gene to generate curated Allele Catalog datasets. Both pipelines were utilized to generate the data panels (VCF files and Allele Catalog files) in which the accessions of the WGRS datasets were collected from various sources, currently representing over 1,000 diverse accessions for soybean, Arabidopsis, and maize individually. The main features of the Allele Catalog Tool include data query, visualization of results, categorical filtering, and download functions. Queries are performed from user input, and results are a tabular format of summary results by categorical description and genotype results of the alleles for each gene. The categorical information is specific to each species; additionally, available detailed meta-information is provided in modal popups. The genotypic information contains the variant positions, reference or alternate genotypes, the functional effect classes, and the amino-acid changes of each accession. Besides that, the results can also be downloaded for other research purposes. Conclusions The Allele Catalog Tool is a web-based tool that currently supports three species: soybean, Arabidopsis, and maize. The Soybean Allele Catalog Tool is hosted on the SoyKB website, while the Allele Catalog Tool for Arabidopsis and maize is hosted on the KBCommons website. Researchers can use this tool to connect variant alleles of genes with meta-information of species.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] The Allele Catalog Tool: a web-based interactive tool for allele discovery and analysis
    Yen On Chan
    Nicholas Dietz
    Shuai Zeng
    Juexin Wang
    Sherry Flint-Garcia
    M. Nancy Salazar-Vidal
    Mária Škrabišová
    Kristin Bilyeu
    Trupti Joshi
    [J]. BMC Genomics, 24
  • [2] HaploSNPer: a web-based allele and SNP detection tool
    Tang, Jifeng
    Leunissen, Jack A. M.
    Voorrips, Roeland E.
    van der Linden, C. Gerard
    Vosman, Ben
    [J]. BMC GENETICS, 2008, 9 (1)
  • [3] HaploSNPer: a web-based allele and SNP detection tool
    Jifeng Tang
    Jack AM Leunissen
    Roeland E Voorrips
    C Gerard van der Linden
    Ben Vosman
    [J]. BMC Genetics, 9
  • [4] CausalMGM: an interactive web-based causal discovery tool
    Ge, Xiaoyu
    Raghu, Vineet K.
    Chrysanthis, Panos K.
    Benos, Panayiotis, V
    [J]. NUCLEIC ACIDS RESEARCH, 2020, 48 (W1) : W597 - W602
  • [5] An Interactive Web-Based Fatigue Analysis Tool
    Kujawski, Daniel
    [J]. 3RD INTERNATIONAL CONFERENCE ON STRUCTURAL INTEGRITY (ICSI 2019), 2019, 17 : 742 - 749
  • [6] An interactive web-based tool for fatigue analysis and life prediction
    Kujawski, Daniel
    Sree, Phani C. R.
    Abburi, Deepak
    Kuok, Joshua T. L.
    [J]. FATIGUE DESIGN 2015, INTERNATIONAL CONFERENCE PROCEEDINGS, 6TH EDITION, 2015, 133 : 299 - 308
  • [7] An interactive web-based teaching tool for hematopathology
    Reddy, V. V. B.
    Bradley, K. T.
    Webber, B. A.
    Raju, N. D.
    Varma, V. A.
    [J]. MODERN PATHOLOGY, 2008, 21 : 105A - 106A
  • [8] An interactive web-based teaching tool for hematopathology
    Reddy, H. V. V.
    Bradley, K. T.
    Webber, B. A.
    Raju, N. D.
    Varma, V. A.
    [J]. LABORATORY INVESTIGATION, 2008, 88 : 105A - 106A
  • [9] geneSurv: An interactive web-based tool for survival analysis in genomics research
    Korkmaz, Selcuk
    Goksuluk, Dincer
    Zararsiz, Gokmen
    Karahan, Sevilay
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 89 : 487 - 496
  • [10] An interactive, Web-based tool for learning anatomic landmarks
    Hallgren, RC
    Parkhurst, PE
    Monson, CL
    Crewe, NM
    [J]. ACADEMIC MEDICINE, 2002, 77 (03) : 263 - 265