Visibiome: an efficient microbiome search engine based on a scalable, distributed

被引:1
|
作者
Azman, Syafiq Kamarul [1 ]
Anwar, Muhammad Zohaib [2 ]
Henschel, Andreas [1 ]
机构
[1] Masdar Inst Sci & Technol, Dept Elect Engn & Comp Sci, Abu Dhabi, U Arab Emirates
[2] Aarhus Univ, Dept Environm Sci, Frederiksborgvej 399, Roskilde, Denmark
来源
BMC BIOINFORMATICS | 2017年 / 18卷
关键词
Microbiome; Microbial diversity; Search engine; ALGORITHM; UNIFRAC;
D O I
10.1186/s12859-017-1763-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Given the current influx of 16S rRNA profiles of microbiota samples, it is conceivable that large amounts of them eventually are available for search, comparison and contextualization with respect to novel samples. This process facilitates the identification of similar compositional features in microbiota elsewhere and therefore can help to understand driving factors for microbial community assembly. Results: We present Visibiome, a microbiome search engine that can perform exhaustive, phylogeny based similarity search and contextualization of user-provided samples against a comprehensive dataset of 16S rRNA profiles environments, while tackling several computational challenges. In order to scale to high demands, we developed a distributed system that combines web framework technology, task queueing and scheduling, cloud computing and a dedicated database server. To further ensure speed and efficiency, we have deployed Nearest Neighbor search algorithms, capable of sublinear searches in high-dimensional metric spaces in combination with an optimized Earth Mover Distance based implementation of weighted UniFrac. The search also incorporates pairwise (adaptive) rarefaction and optionally, 16S rRNA copy number correction. The result of a query microbiome sample is the contextualization against a comprehensive database of microbiome samples from a diverse range of environments, visualized through a rich set of interactive figures and diagrams, including barchart-based compositional comparisons and ranking of the closest matches in the database. Conclusions: Visibiome is a convenient, scalable and efficient framework to search microbiomes against a comprehensive database of environmental samples. The search engine leverages a popular but computationally expensive, phylogeny based distance metric, while providing numerous advantages over the current state of the art tool.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A Scalable and Efficient Correlation Engine to Detect Multi-step Attacks in Distributed Systems
    Lanoe, David
    Hurfin, Michel
    Totel, Eric
    [J]. 2018 IEEE 37TH INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2018, : 31 - 40
  • [22] Scalable hybrid search on distributed databases
    Kim, J
    Fox, G
    [J]. COMPUTATIONAL SCIENCE - ICCS 2005, PT 3, 2005, 3516 : 431 - 438
  • [23] Efficient parallelised search engine based on virtual cluster
    Hung, Che Lun
    Lin, Chun-Yuan
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2016, 12 (01) : 53 - 57
  • [24] Reliable distributed search engine based on multiple meta servers
    Sato, N
    Udagawa, M
    Uehara, M
    Sakai, Y
    Mori, H
    [J]. FIRST INTERNATIONAL SYMPOSIUM ON CYBER WORLDS, PROCEEDINGS, 2002, : 79 - 84
  • [25] Distributed Search Engine Architecture Based On Topic Specific Searches
    Abudaqqa, Yousra
    Patel, Ahmed
    [J]. INTERNATIONAL CONFERENCE ON MATHEMATICS, ENGINEERING AND INDUSTRIAL APPLICATIONS 2014 (ICOMEIA 2014), 2015, 1660
  • [26] Design and Implementation of Distributed Image Search Engine Based on Content
    Sun Huabin
    Liu Xiaoxia
    [J]. FBIE: 2008 INTERNATIONAL SEMINAR ON FUTURE BIOMEDICAL INFORMATION ENGINEERING, PROCEEDINGS, 2008, : 64 - 66
  • [27] A topic-based and distributed search engine for business intelligence
    Fei, Yulian
    Zhu, Anding
    Wang, Guangmin
    [J]. DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 913 - 917
  • [28] CellAtlasSearch: a scalable search engine for single cells
    Srivastava, Divyanshu
    Iyer, Arvind
    Kumar, Vibhor
    Sengupta, Debarka
    [J]. NUCLEIC ACIDS RESEARCH, 2018, 46 (W1) : W141 - W147
  • [29] A Scalable Movie Recommendation System on Search Engine
    Shi, Mingruo
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON MECHATRONICS ENGINEERING AND INFORMATION TECHNOLOGY (ICMEIT 2017), 2017, 70 : 205 - 208
  • [30] Scalable and Efficient Comparison-based Search without Features
    Chumbalov, Daniyar
    Maystre, Lucas
    Grossglauser, Matthias
    [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,