The Allele Catalog Tool: a web-based interactive tool for allele discovery and analysis

被引:4
|
作者
Chan, Yen On [1 ,2 ]
Dietz, Nicholas [3 ]
Zeng, Shuai [4 ]
Wang, Juexin [2 ,4 ]
Flint-Garcia, Sherry [5 ]
Salazar-Vidal, M. Nancy [3 ,6 ]
Skrabisova, Maria [7 ]
Bilyeu, Kristin [5 ]
Joshi, Trupti [1 ,2 ,4 ,8 ]
机构
[1] Univ Missouri Columbia, MU Inst Data Sci & Informat, Columbia, MO 65211 USA
[2] Univ Missouri Columbia, Christopher S Bond Life Sci Ctr, Columbia, MO 65211 USA
[3] Univ Missouri Columbia, Div Plant Sci & Technol, Columbia, MO USA
[4] Univ Missouri Columbia, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
[5] USDA ARS, Plant Genet Res Unit, Columbia, MO 65211 USA
[6] Univ Calif Davis, Dept Evolut & Ecol, Davis, CA USA
[7] Palacky Univ Olomouc, Fac Sci, Dept Biochem, Olomouc, Czech Republic
[8] Univ Missouri Columbia, Dept Hlth Management & Informat, Columbia, MO 65211 USA
关键词
Variant Calling Pipeline; Allele Catalog Pipeline; Allele Catalog Tool; Alleles in Gene; Data Visualization; SEED DORMANCY; ARABIDOPSIS; ADAPTATION; DOG1;
D O I
10.1186/s12864-023-09161-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background The advancement of sequencing technologies today has made a plethora of whole-genome re-sequenced (WGRS) data publicly available. However, research utilizing the WGRS data without further configuration is nearly impossible. To solve this problem, our research group has developed an interactive Allele Catalog Tool to enable researchers to explore the coding region allelic variation present in over 1,000 re-sequenced accessions each for soybean, Arabidopsis, and maize. Results The Allele Catalog Tool was designed originally with soybean genomic data and resources. The Allele Catalog datasets were generated using our variant calling pipeline (SnakyVC) and the Allele Catalog pipeline (AlleleCatalog). The variant calling pipeline is developed to parallelly process raw sequencing reads to generate the Variant Call Format (VCF) files, and the Allele Catalog pipeline takes VCF files to perform imputations, functional effect predictions, and assemble alleles for each gene to generate curated Allele Catalog datasets. Both pipelines were utilized to generate the data panels (VCF files and Allele Catalog files) in which the accessions of the WGRS datasets were collected from various sources, currently representing over 1,000 diverse accessions for soybean, Arabidopsis, and maize individually. The main features of the Allele Catalog Tool include data query, visualization of results, categorical filtering, and download functions. Queries are performed from user input, and results are a tabular format of summary results by categorical description and genotype results of the alleles for each gene. The categorical information is specific to each species; additionally, available detailed meta-information is provided in modal popups. The genotypic information contains the variant positions, reference or alternate genotypes, the functional effect classes, and the amino-acid changes of each accession. Besides that, the results can also be downloaded for other research purposes. Conclusions The Allele Catalog Tool is a web-based tool that currently supports three species: soybean, Arabidopsis, and maize. The Soybean Allele Catalog Tool is hosted on the SoyKB website, while the Allele Catalog Tool for Arabidopsis and maize is hosted on the KBCommons website. Researchers can use this tool to connect variant alleles of genes with meta-information of species.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] ChIPseek, a web-based analysis tool for ChIP data
    Ting-Wen Chen
    Hsin-Pai Li
    Chi-Ching Lee
    Ruei-Chi Gan
    Po-Jung Huang
    Timothy H Wu
    Cheng-Yang Lee
    Yi-Feng Chang
    Petrus Tang
    [J]. BMC Genomics, 15
  • [42] LipidSig: a web-based tool for lipidomic data analysis
    Lin, Wen-Jen
    Shen, Pei-Chun
    Liu, Hsiu-Cheng
    Cho, Yi-Chun
    Hsu, Min-Kung
    Lin, I-Chen
    Chen, Fang-Hsin
    Yang, Juan-Cheng
    Ma, Wen-Lung
    Cheng, Wei-Chung
    [J]. NUCLEIC ACIDS RESEARCH, 2021, 49 (W1) : W336 - W345
  • [43] A web-based interactive tool to enhance clinical reasoning skills of medical students
    Weerasuriya, A
    Kirk, C
    [J]. FASEB JOURNAL, 2003, 17 (04): : A392 - A392
  • [44] KinMap: a web-based tool for interactive navigation through human kinome data
    Sameh Eid
    Samo Turk
    Andrea Volkamer
    Friedrich Rippmann
    Simone Fulle
    [J]. BMC Bioinformatics, 18
  • [45] Design and Development of a Web-based Interactive Software Tool for Teaching Operating Systems
    Garmpis, Aristogiannis
    [J]. JOURNAL OF INFORMATION TECHNOLOGY EDUCATION-RESEARCH, 2011, 10 : 1 - 17
  • [46] BSDD: Biomolecules segment display device - a web-based interactive display tool
    Selvarani, P
    Shanthi, V
    Rajesh, CK
    Saravanan, S
    Sekar, K
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : W645 - W648
  • [47] Five year-report on a web-based interactive dosimetry training tool
    Keall, P.
    Kaylor, S.
    McCune, D.
    Boyer, A.
    [J]. MEDICAL PHYSICS, 2007, 34 (06) : 2400 - 2400
  • [48] Interactive Peptide Spectral Annotator: A Versatile Web-based Tool for Proteomic Applications
    Brademan, Dain R.
    Riley, Nicholas M.
    Kwiecien, Nicholas W.
    Coon, Joshua J.
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2019, 18 (08) : S193 - S201
  • [49] KinMap: a web-based tool for interactive navigation through human kinome data
    Eid, Sameh
    Turk, Samo
    Volkamer, Andrea
    Rippmann, Friedrich
    Fulle, Simone
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [50] Branch: an interactive, web-based tool for testing hypotheses and developing predictive models
    Gangavarapu, Karthik
    Babji, Vyshakh
    Meissner, Tobias
    Su, Andrew I.
    Good, Benjamin M.
    [J]. BIOINFORMATICS, 2016, 32 (13) : 2072 - 2074