RabbitKSSD: accelerating genome distance estimation on modern multi-core architectures

被引:1
|
作者
Xu, Xiaoming [1 ]
Yin, Zekun [1 ]
Yan, Lifeng [1 ]
Yi, Huiguang [2 ]
Wang, Hua [1 ]
Schmidt, Bertil [3 ]
Liu, Weiguo [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan, Peoples R China
[2] Chinese Acad Agr Sci, Agr Genom Inst, Genome Anal Lab,Minist Agr, Guangdong Lab Lingnan Modern Agr,Shenzhen Branch, Shenzhen, Peoples R China
[3] Johannes Gutenberg Univ Mainz, Inst Comp Sci, Mainz, Germany
关键词
D O I
10.1093/bioinformatics/btad695
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We propose RabbitKSSD, a high-speed genome distance estimation tool. Specifically, we leverage load-balanced task partitioning, fast I/O, efficient intermediate result accesses, and high-performance data structures to improve overall efficiency. Our performance evaluation demonstrates that RabbitKSSD achieves speedups ranging from 5.7 x to 19.8 x over Kssd for the time-consuming sketch generation and distance computation on commonly used workstations. In addition, it significantly outperforms Mash, BinDash, and Dashing2. Moreover, RabbitKSSD can efficiently perform all-vs-all distance computation for all RefSeq complete bacterial genomes (455 GB in FASTA format) in just 2 min on a 64-core workstation.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Cache Efficiency and Scalability on Multi-core Architectures
    Mueller, Thomas
    Trinitis, Carsten
    Smajic, Jasmin
    PARALLEL COMPUTING TECHNOLOGIES, 2011, 6873 : 88 - +
  • [22] Bias Scheduling in Heterogeneous Multi-core Architectures
    Koufaty, David
    Reddy, Dheeraj
    Hahn, Scott
    EUROSYS'10: PROCEEDINGS OF THE EUROSYS 2010 CONFERENCE, 2010, : 125 - 138
  • [23] Leveraging Multi-Core Computing Architectures in Avionics
    Nowotsch, Jan
    Paulitsch, Michael
    2012 NINTH EUROPEAN DEPENDABLE COMPUTING CONFERENCE (EDCC 2012), 2012, : 132 - 143
  • [24] An Optimization for MapReduce Frameworks in Multi-core Architectures
    Ferreira, Tharso
    Espinosa, Antonio
    Carlos Moure, Juan
    Hernandez, Porfidio
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 2587 - 2590
  • [25] Challenges for Timing Analysis of Multi-Core Architectures
    Reineke, Jan
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2017, (248): : 4 - +
  • [26] Impact of thermal constraints on multi-core architectures
    Li, Yingmin
    Lee, Benjamin
    Brooks, David
    Hu, Zhigang
    Skadron, Kevin
    2006 PROCEEDINGS 10TH INTERSOCIETY CONFERENCE ON THERMAL AND THERMOMECHANICAL PHENOMENA IN ELECTRONICS SYSTEMS, VOLS 1 AND 2, 2006, : 132 - +
  • [27] A framework for power estimation and reduction in multi-core architectures using basic block approach
    Babu, M. Rajasekhara
    Krishna, P. Venkata
    Khalid, M.
    INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2013, 10 (01) : 40 - 51
  • [28] Accelerating the Reorthogonalization of Singular Vectors with a Multi-core Processor
    Toyokawa, Hiroki
    Ishigami, Hiroyuki
    Kimura, Kinji
    Takata, Masami
    Nakamura, Yoshimasa
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2012, 2013, 7851 : 379 - 390
  • [29] Accelerating the MMD algorithm using Multi-core Environments
    Schloesser, Michael
    Herpers, Rainer
    Kent, Kenneth B.
    2011 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2011, : 340 - 345
  • [30] Accelerating sequential programs on commodity multi-core processors
    Zhang, Yuanming
    Xiao, Gang
    Baba, Takanobu
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2014, 74 (04) : 2257 - 2265