Visibiome: an efficient microbiome search engine based on a scalable, distributed

被引:1
|
作者
Azman, Syafiq Kamarul [1 ]
Anwar, Muhammad Zohaib [2 ]
Henschel, Andreas [1 ]
机构
[1] Masdar Inst Sci & Technol, Dept Elect Engn & Comp Sci, Abu Dhabi, U Arab Emirates
[2] Aarhus Univ, Dept Environm Sci, Frederiksborgvej 399, Roskilde, Denmark
来源
BMC BIOINFORMATICS | 2017年 / 18卷
关键词
Microbiome; Microbial diversity; Search engine; ALGORITHM; UNIFRAC;
D O I
10.1186/s12859-017-1763-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Given the current influx of 16S rRNA profiles of microbiota samples, it is conceivable that large amounts of them eventually are available for search, comparison and contextualization with respect to novel samples. This process facilitates the identification of similar compositional features in microbiota elsewhere and therefore can help to understand driving factors for microbial community assembly. Results: We present Visibiome, a microbiome search engine that can perform exhaustive, phylogeny based similarity search and contextualization of user-provided samples against a comprehensive dataset of 16S rRNA profiles environments, while tackling several computational challenges. In order to scale to high demands, we developed a distributed system that combines web framework technology, task queueing and scheduling, cloud computing and a dedicated database server. To further ensure speed and efficiency, we have deployed Nearest Neighbor search algorithms, capable of sublinear searches in high-dimensional metric spaces in combination with an optimized Earth Mover Distance based implementation of weighted UniFrac. The search also incorporates pairwise (adaptive) rarefaction and optionally, 16S rRNA copy number correction. The result of a query microbiome sample is the contextualization against a comprehensive database of microbiome samples from a diverse range of environments, visualized through a rich set of interactive figures and diagrams, including barchart-based compositional comparisons and ranking of the closest matches in the database. Conclusions: Visibiome is a convenient, scalable and efficient framework to search microbiomes against a comprehensive database of environmental samples. The search engine leverages a popular but computationally expensive, phylogeny based distance metric, while providing numerous advantages over the current state of the art tool.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Visibiome: an efficient microbiome search engine based on a scalable, distributed architecture
    Syafiq Kamarul Azman
    Muhammad Zohaib Anwar
    Andreas Henschel
    [J]. BMC Bioinformatics, 18
  • [2] An efficient and scalable search engine for models
    Hernandez Lopez, Jose Antonio
    Sanchez Cuadrado, Jesus
    [J]. SOFTWARE AND SYSTEMS MODELING, 2022, 21 (05): : 1715 - 1737
  • [3] An efficient and scalable search engine for models
    José Antonio Hernández López
    Jesús Sánchez Cuadrado
    [J]. Software and Systems Modeling, 2022, 21 : 1715 - 1737
  • [4] MINERVA∞:: A scalable efficient peer-to-peer search engine
    Michel, S
    Triantafillou, P
    Weikum, G
    [J]. MIDDLEWARE 2005, PROCEEDINGS, 2005, 3790 : 60 - 81
  • [5] Khuzdul: Efficient and Scalable Distributed Graph Pattern Mining Engine
    Chen, Jingji
    Qian, Xuehai
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 2, ASPLOS 2023, 2023, : 413 - 426
  • [6] Distributed Hayabusa: Scalable Syslog Search Engine Optimized for Time-Dimensional Search
    Abe, Hiroshi
    Shima, Keiichi
    Miyamoto, Daisuke
    Sekiya, Yuji
    Ishihara, Tomohiro
    Okada, Kazuya
    Nakamura, Ryo
    Matsuura, Satoshi
    [J]. ASIAN INTERNET ENGINEERING CONFERENCE (AINTEC 2019), 2019, : 9 - 16
  • [7] An efficient and scalable Arabic semantic search engine based on a domain specific ontology and question answering
    Sayed, Awny
    Al Muqrishi, Amal
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2016, 12 (02) : 242 - 262
  • [8] Research Distributed Search Engine Based on Hadoop
    Gu, Rui
    [J]. 2015 International Conference on Network and Information Systems for Computers (ICNISC), 2015, : 373 - 375
  • [9] A RESEARCH ON DISTRIBUTED SEARCH ENGINE BASED ON HADOOP
    Rui, Liu
    Lei, Cao
    [J]. 2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 2, 2012, : 141 - 145
  • [10] A scalable topic-based open souirce search engine
    Buntine, W
    Löfström, J
    Perkiö, J
    Perttu, S
    Poroshin, V
    Silander, T
    Tirri, H
    Tuominen, A
    Tuulos, V
    [J]. IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 228 - 234