BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data

被引:6
|
作者
Neumann, Ralf Stefan [1 ,2 ]
Kumar, Surendra [1 ,2 ]
Haverkamp, Thomas Hendricus Augustus [3 ]
Shalchian-Tabrizi, Kamran [1 ,2 ]
机构
[1] Univ Oslo, Sect Genet & Evolutionary Biol EVOGENE, Oslo, Norway
[2] Univ Oslo, CEDE, Oslo, Norway
[3] Univ Oslo, Dept Biosci, Ctr Ecol & Evolutionary Synth, Oslo, Norway
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
Analysis; BLAST; High-throughput; Taxonomy; Text-mining; Visualization; DISCOVERY; TAXONOMY; OUTPUT;
D O I
10.1186/1471-2105-15-128
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprecedented scale. This has been possible due to high-performance computers and parallel processing. However, the raw BLAST output from contemporary searches involving thousands of queries becomes ill-suited for direct human processing. Few programs attempt to directly visualize and interpret BLAST output; those that do often provide a mere basic structuring of BLAST data. Results: Here we present a bioinformatics application named BLASTGrabber suitable for high-throughput sequencing analysis. BLASTGrabber, being implemented as a Java application, is OS-independent and includes a user friendly graphical user interface. Text or XML-formatted BLAST output files can be directly imported, displayed and categorized based on BLAST statistics. Query names and FASTA headers can be analysed by text-mining. In addition to visualizing sequence alignments, BLAST data can be ordered as an interactive taxonomy tree. All modes of analysis support selection, export and storage of data. A Java interface-based plugin structure facilitates the addition of customized third party functionality. Conclusion: The BLASTGrabber application introduces new ways of visualizing and analysing massive BLAST output data by integrating taxonomy identification, text mining capabilities and generic multi-dimensional rendering of BLAST hits. The program aims at a non-expert audience in terms of computer skills; the combination of new functionalities makes the program flexible and useful for a broad range of operations.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] A visualization tool for exploratory analysis of cyclic multivariate data
    Matthew O. Ward
    Benjamin N. Lipchak
    Metrika, 2000, 51 : 27 - 37
  • [22] A visualization tool for exploratory analysis of cyclic multivariate data
    Ward, MO
    Lipchak, BN
    METRIKA, 2000, 51 (01) : 27 - 37
  • [23] WebScope: A New Tool for Fusion Data Analysis and Visualization
    杨飞
    党宁宁
    肖炳甲
    Plasma Science and Technology, 2010, (02) : 253 - 256
  • [24] Visualization Tool for Comparative Analysis of Seabird Movement Data
    Onuki, Tomoya
    Misue, Kazuo
    2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 280 - 286
  • [25] A Visualization Tool for Eye Tracking Data Analysis in the Web
    Menges, Raphael
    Kramer, Sophia
    Hill, Stefan
    Nisslmueller, Marius
    Kumar, Chandan
    Staab, Steffen
    ETRA 2020 SHORT PAPERS: ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, 2020,
  • [26] CMDnavigator: A tool for interactive analysis and visualization of peptide data
    Bayden, Alexander
    Diller, David
    Audie, Joseph
    Diller, Kyle
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [27] WebScope: A New Tool for Fusion Data Analysis and Visualization
    Yang Fei
    Dang Ningning
    Xiao Bingjia
    PLASMA SCIENCE & TECHNOLOGY, 2010, 12 (02) : 253 - 256
  • [28] Phenostat: visualization and statistical tool for analysis of phenotyping data
    Reuveni, Eli
    Carola, Valeria
    Al Banchaabouchi, Mumna
    Rosenthal, Nadia
    Hancock, John M.
    Gross, Cornelius
    MAMMALIAN GENOME, 2007, 18 (09) : 677 - 681
  • [29] Selection sequences - Interactive analysis of massive data sets
    Theus, M
    Hofmann, H
    Wilhelm, AFX
    MINING AND MODELING MASSIVE DATA SETS IN SCIENCE, ENGINEERING, AND BUSINESS WITH A SUBTHEME IN ENVIRONMENTAL STATISTICS, 1997, 29 (01): : 439 - 444
  • [30] compreheNGSive - A Visualization Tool for Prioritizing Variants from Next Generation Sequence Data
    Bigelow, Alex
    Meyer, Miriah
    Camp, Nicola J.
    GENETIC EPIDEMIOLOGY, 2012, 36 (07) : 734 - 734