FIFO-based Hardware Sorters for High Bandwidth Memory

被引:0
|
作者
Nakano, Koji [1 ]
Ito, Yasuaki [1 ]
Bordim, Jacir L. [2 ]
机构
[1] Hiroshima Univ, Dept Informat Engn, Kagamiyama 1-4-1, Higashihiroshima 7398527, Japan
[2] Univ Brasilia, Dept Comp Sci, BR-70910900 Brasilia, DF, Brazil
关键词
parallel sorting algorithms; hardware sorter; high bandwidth memory; burst memory access; big data analysis;
D O I
10.1109/IPDPSW.2019.00112
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The main contribution of this paper is to show efficient FIFO-based hardware sorters that sort n elements with w bits each stored in a high bandwidth memory with modest access latency. We assume that each address of the high bandwidth memory can store p elements of w bits each, which can be read or written at the same time. The access latency l of the high bandwidth memory is assumed to take l clock cycles to access p elements in a specified address. Furthermore, burst mode is supported and k (>= 1) consecutive addresses can be accessed in k + l - 1 clock cycles in a pipeline fashion. However, if k addresses are not consecutive, kl clock cycles are necessary to access all of them. Clearly, all n elements arranged n/p addresses can be duplicated in 2(n/p + l - 1) clock cycles. We present two types of hardware sorters that sort n = rc elements stored in an r x c matrix of the high bandwidth memory. We first develop Three-Pass-Sort and Four-Pass-Sort that sort an r x c matrix by reading from and witting in it three times and four times, respectively. We implement these two algorithms using FIFO-based mergers that can be configured as pairwise mode and sliding mode. Our hardware sorter based on Three-Pass-Sort runs in 6n/p + 3c(2)/p(2)l + O(c/p (l + log r) + r) clock cycles using a circuit of size O(rwp) provided that r >= c(2). Also, our hardware sorter based on Four-Pass-Sort runs in 8n/p + 2c(2)l + O(cl + log r + p) clock cycles using a circuit of size O(rw).
引用
收藏
页码:663 / 672
页数:10
相关论文
共 50 条
  • [1] A Hardware Design Generator of High-Performance FIFO-based Linear Insertion Streaming Sorters
    Petrovic, Marija L.
    Milovanovic, Vladimir M.
    [J]. 2023 30TH INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM, MIXDES, 2023, : 79 - 82
  • [2] Design of memory efficient FIFO-based merge sorter
    Kim, Youngil
    Choi, Seungdo
    Song, Yong Ho
    [J]. IEICE ELECTRONICS EXPRESS, 2018, 15 (05):
  • [3] A FIFO-based architecture for high speed image compression
    Masoudnia, A
    Sarbazi-Azad, H
    Boussakta, S
    [J]. ICECS 2001: 8TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS I-III, CONFERENCE PROCEEDINGS, 2001, : 221 - 224
  • [4] Service provisioning policy for FIFO-based GFR scheme
    Fujitsu Lab Ltd, Kawasaki, Japan
    [J]. IEEE ATM Workshop, Proceedings, 1999, : 31 - 36
  • [5] A simple FIFO-based scheme for differentiated loss guarantees
    Huang, Yaqing
    Guerin, Roch
    [J]. COMPUTER NETWORKS, 2007, 51 (04) : 1133 - 1150
  • [6] A simple FIFO-based scheme for differentiated loss guarantees
    Huang, YQ
    Guérin, R
    [J]. 2004 TWELFTH IEEE INTERNATIONAL WORKSHOP ON QUALITY OF SERVICE, 2004, : 96 - 105
  • [7] A FIFO-BASED MULTICAST NETWORK AND ITS USE IN MULTICOMPUTERS
    MOONA, R
    RAJARAMAN, V
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 1991, 15 (10) : 543 - 547
  • [8] A FIFO-based buffer management approach for the ATM GFR services
    Chan, CT
    Hu, SC
    Wang, PC
    Chen, YC
    [J]. IEEE COMMUNICATIONS LETTERS, 2000, 4 (06) : 205 - 207
  • [9] SUPPORTING DISTRIBUTED OBJECTS IN FIFO-BASED MESSAGE-PASSING SYSTEMS
    CHANG, WT
    TSENG, CC
    [J]. JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1995, 7 (09): : 56 - &
  • [10] Array processor featuring an effective FIFO-based data stream management
    Miyazaki, Toshiaki
    Nomoto, Yuusuke
    Sato, Yuka
    Sedukhin, Stanislav G.
    [J]. 2008 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 255 - 260