A performance comparison of data and memory allocation strategies for sequence aligners on NUMA architectures

被引:0
|
作者
Josefina Lenis
Miquel Angel Senar
机构
[1] Universitat Autònoma de Barcelona,Computer Architecture and Operating Systems Department
来源
Cluster Computing | 2017年 / 20卷
关键词
NUMA; Data partitioning; Memory system performance; Genomic aligners; BWA; Bowtie; GEM; SNAP;
D O I
暂无
中图分类号
学科分类号
摘要
Over the last several years, many sequence alignment tools have appeared and become popular for the fast evolution of next generation sequencing technologies. Obviously, researchers that use such tools are interested in getting maximum performance when they execute them in modern infrastructures. Today’s NUMA (Non-uniform memory access) architectures present major challenges in getting such applications to achieve good scalability as more processors/cores are used. The memory system in NUMA systems shows a high complexity and may be the main cause for the loss of an application’s performance. The existence of several memory banks in NUMA systems implies a logical increase in latency associated with the accesses of a given processor to a remote bank. This phenomenon is usually attenuated by the application of strategies that tend to increase the locality of memory accesses. However, NUMA systems may also suffer from contention problems that can occur when concurrent accesses are concentrated on a reduced number of banks. Sequence alignment tools use large data structures to contain reference genomes to which all reads are aligned. Therefore, these tools are very sensitive to performance problems related to the memory system. The main goal of this study is to explore the trade-offs between data locality and data dispersion in NUMA systems. We have performed experiments with several popular sequence alignment tools on two widely available NUMA systems to assess the performance of different memory allocation policies and data partitioning strategies. We find that there is not one method that is best in all cases. However, we conclude that memory interleaving is the memory allocation strategy that provides the best performance when a large number of processors and memory banks are used. In the case of data partitioning, the best results are usually obtained when the number of partitions used is greater, sometimes combined with an interleave policy.
引用
收藏
页码:1909 / 1924
页数:15
相关论文
共 42 条
  • [1] A performance comparison of data and memory allocation strategies for sequence aligners on NUMA architectures
    Lenis, Josefina
    Angel Senar, Miquel
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (03): : 1909 - 1924
  • [2] Optimized Execution Strategies for Sequence Aligners on NUMA Architectures
    Lenis, Josefina
    Senar, Miquel Angel
    [J]. EURO-PAR 2016: PARALLEL PROCESSING WORKSHOPS, 2017, 10104 : 492 - 503
  • [3] Strategies for Dynamic Memory Allocation in Hybrid Architectures
    Bertels, Peter
    Heirman, Wim
    Stroobandt, Dirk
    [J]. CF'09: CONFERENCE ON COMPUTING FRONTIERS & WORKSHOPS, 2009, : 217 - 220
  • [4] Evaluation of memory performance in NUMA architectures using Stochastic Reward Nets
    Entezari-Maleki, Reza
    Cho, Younghyun
    Egger, Bernhard
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 144 : 172 - 188
  • [5] AdaptMD: Balancing Space and Performance in NUMA Architectures With Adaptive Memory Deduplication
    Yao, Lulu
    Li, Yongkun
    Lee, Patrick P. C.
    Wang, Xiaoyang
    Xu, Yinlong
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (06) : 1588 - 1602
  • [6] Using Data Dependencies to Improve Task-Based Scheduling Strategies on NUMA Architectures
    Virouleau, Philippe
    Broquedis, Francois
    Gautier, Thierry
    Rastello, Fabrice
    [J]. EURO-PAR 2016: PARALLEL PROCESSING, 2016, 9833 : 531 - 544
  • [7] Achieving High Performance With TCP Over 40 GbE on NUMA Architectures for CMS Data Acquisition
    Bawej, Tomasz
    Behrens, Ulf
    Branson, James
    Chaze, Olivier
    Cittolin, Sergio
    Darlea, Georgiana-Lavinia
    Deldicque, Christian
    Dobson, Marc
    Dupont, Aymeric
    Erhan, Samim
    Forrest, Andrew
    Gigi, Dominique
    Glege, Frank
    Gomez-Ceballos, Guillelmo
    Gomez-Reino, Robert
    Hegeman, Jeroen
    Holzner, Andre
    Masetti, Lorenzo
    Meijers, Frans
    Meschi, Emilio
    Mommsen, Remigius K.
    Morovic, Srecko
    Nunez-Barranco-Fernandez, Carlos
    O'Dell, Vivian
    Orsini, Luciano
    Paus, Christoph
    Petrucci, Andrea
    Pieri, Marco
    Racz, Attila
    Sakulin, Hannes
    Schwick, Christoph
    Stieger, Benjamin
    Sumorok, Konstanty
    Veverka, Jan
    Wakefield, Christopher C.
    Zejdl, Petr
    [J]. IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2015, 62 (03) : 1091 - 1098
  • [8] NUMA-aware Scheduling and Memory Allocation for data-flow task-parallel Applications
    Drebes, Andi
    Pop, Antoniu
    Heydemann, Karine
    Drach, Nathalie
    Cohen, Albert
    [J]. ACM SIGPLAN NOTICES, 2016, 51 (08) : 391 - 392
  • [9] Achieving High Performance with TCP over 40GbE on NUMA Architectures for CMS Data Acquisition
    Bawej, Tomasz
    Behrens, Ulf
    Branson, James
    Chaze, Olivier
    Cittolin, Sergio
    Darlea, Georgiana-Lavinia
    Deldicque, Christian
    Dobson, Marc
    Dupont, Aymeric
    Erhan, Samim
    Forrest, Andrew
    Gigi, Dominique
    Glege, Frank
    Gomez-Ceballos, Guillelmo
    Gomez-Reino, Robert
    Hegeman, Jeroen
    Holzner, Andre
    Masetti, Lorenzo
    Meijers, Frans
    Meschi, Emilio
    Mommsen, Remigius K.
    Morovic, Srecko
    Nunez-Barranco-Fernandez, Carlos
    O'Dell, Vivian
    Orsini, Luciano
    Paus, Christoph
    Petrucci, Andrea
    Pieri, Marco
    Racz, Attila
    Sakulin, Hannes
    Schwick, Christoph
    Stieger, Benjamin
    Sumorok, Konstanty
    Veverka, Jan
    Wakefield, Christopher C.
    Zejdl, Petr
    [J]. 2014 19TH IEEE-NPSS REAL TIME CONFERENCE (RT), 2014,
  • [10] Performance comparison of switch architectures with Shareable Parallel Memory Modules
    Kumar, S
    Munoz, A
    [J]. IEEE COMMUNICATIONS LETTERS, 2005, 9 (11) : 1015 - 1017