RabbitKSSD: accelerating genome distance estimation on modern multi-core architectures

被引:1
|
作者
Xu, Xiaoming [1 ]
Yin, Zekun [1 ]
Yan, Lifeng [1 ]
Yi, Huiguang [2 ]
Wang, Hua [1 ]
Schmidt, Bertil [3 ]
Liu, Weiguo [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan, Peoples R China
[2] Chinese Acad Agr Sci, Agr Genom Inst, Genome Anal Lab,Minist Agr, Guangdong Lab Lingnan Modern Agr,Shenzhen Branch, Shenzhen, Peoples R China
[3] Johannes Gutenberg Univ Mainz, Inst Comp Sci, Mainz, Germany
关键词
D O I
10.1093/bioinformatics/btad695
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We propose RabbitKSSD, a high-speed genome distance estimation tool. Specifically, we leverage load-balanced task partitioning, fast I/O, efficient intermediate result accesses, and high-performance data structures to improve overall efficiency. Our performance evaluation demonstrates that RabbitKSSD achieves speedups ranging from 5.7 x to 19.8 x over Kssd for the time-consuming sketch generation and distance computation on commonly used workstations. In addition, it significantly outperforms Mash, BinDash, and Dashing2. Moreover, RabbitKSSD can efficiently perform all-vs-all distance computation for all RefSeq complete bacterial genomes (455 GB in FASTA format) in just 2 min on a 64-core workstation.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] High Performance Global Illumination on Multi-core Architectures
    Padron, Emilio J.
    Amor, Margarita
    Doallo, Ramon
    Boo, Montserrat
    PROCEEDINGS OF THE PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2009, : 93 - +
  • [42] Sparse matrix operations on several multi-core architectures
    Trinitis, Carsten
    Kuestner, Tilman
    Weidendorfer, Josef
    Smajic, Jasmin
    JOURNAL OF SUPERCOMPUTING, 2011, 57 (02): : 132 - 140
  • [43] Automated Tuning in Parallel Sorting on Multi-core Architectures
    Lin, Haibo
    Li, Chao
    Wang, Qian
    Zhao, Yi
    Pan, Ninghe
    Zhuang, Xiaotong
    Shao, Ling
    EURO-PAR 2010 PARALLEL PROCESSING, PT I, 2010, 6271 : 14 - +
  • [44] MODELS FOR PARALLEL WORKFLOW PROCESSING ON MULTI-CORE ARCHITECTURES
    Rauber, Thomas
    Ruenger, Gudula
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL ISAS-2: INFORMATION SYSTEMS ANALYSIS AND SPECIFICATION, VOL 2, 2008, : 220 - +
  • [45] Optimizing process creation and execution on multi-core architectures
    Kulkarni, Abhishek
    Ionkov, Latchesar
    Lang, Michael
    Lumsdaine, Andrew
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2013, 27 (02): : 147 - 161
  • [46] Automatic Parallelization of Simulink Models for Multi-core Architectures
    Tuncali, Cumhur Erkan
    Fainekos, Georgios
    Lee, Yann-Hang
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 964 - 971
  • [47] Performance issues in emerging homogeneous multi-core architectures
    Kayi, Abdullah
    El-Ghazawi, Tarek
    Newby, Gregory B.
    SIMULATION MODELLING PRACTICE AND THEORY, 2009, 17 (09) : 1485 - 1499
  • [48] Practice of Parallelizing Network Applications on Multi-core Architectures
    Wang, Junchang
    Cheng, Haipeng
    Hua, Bei
    Tang, Xinan
    ICS'09: PROCEEDINGS OF THE 2009 ACM SIGARCH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2009, : 204 - 213
  • [49] A parallel algorithm for coverage optimization on multi-core architectures
    Wei, Ran
    Murray, Alan T.
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2016, 30 (03) : 432 - 450
  • [50] Hardware Supported Time Synchronization in Multi-Core Architectures
    Lynch, Elizabeth Whitaker
    Riley, George F.
    PADS 2009: 23RD WORKSHOP ON PRINCIPLES OF ADVANCED AND DISTRIBUTED SIMULATION, PROCEEDINGS, 2009, : 88 - 94