A High Throughput Parallel Hash Table on FPGA using XOR-based Memory

被引:18
|
作者
Zhang, Ruizhi [1 ]
Wijeratne, Sasindu [1 ]
Yang, Yang [1 ]
Kuppannagari, Sanmukh R. [1 ]
Prasanna, Viktor K. [1 ]
机构
[1] Univ Southern Calif, Dept Elect & Comp Engn, Los Angeles, CA 90007 USA
基金
美国国家科学基金会;
关键词
Parallel Hash table; FPGA Acceleration;
D O I
10.1109/hpec43674.2020.9286199
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hash table is a fundamental data structure for quick search and retrieval of data. It is a key component in complex graph analytics and AI/ML applications. State-of-the-art parallel hash table implementations either make some simplifying assumptions such as supporting only a subset of hash table operations or employ optimizations that lead to performance that is highly data dependent and in the worst case can be similar to a sequential implementation. In contrast, in this work we develop a dynamic hash table that supports all the hash table queries - search, insert, delete, update, while allowing us to support p parallel queries (p > 1) per clock cycle via p processing engines (PEs) in the worst case i.e. the performance is data agnostic. We achieve this by implementing novel XOR based multi-ported block memories on FPGAs. Additionally, we develop a technique to optimize the memory requirement of the hash table if the ratio of search to insert/update/delete queries is known beforehand. We implement our design on state-of-the-art FPGA devices. Our design is scalable to 16 PEs and supports throughput up to 5926 MOPS. It matches the throughput of the state-of-the-art hash table design - FASTHash, which only supports search and insert operations. Comparing with the best FPGA design that supports the same set of operations, our hash table achieves up to 12.3 x speedup.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Performance of an Intuitive Hash Table in Shared-Memory Parallel Programs
    Cischke, Christopher
    HIGH PERFORMANCE COMPUTING SYMPOSIUM 2013 (HPC 2013) - 2013 SPRING SIMULATION MULTI-CONFERENCE (SPRINGSIM'13), 2013, 45 (06): : 10 - 14
  • [22] Accelerating XOR-Based Erasure Coding using Program Optimization Techniques
    Uezato, Yuya
    SC21: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2021,
  • [23] XOR-based visual secret sharing scheme using pixel vectorization
    Kannojia, Suresh Prasad
    Kumar, Jasvant
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (10) : 14609 - 14635
  • [24] In-Memory Checkpointing for MPI Programs by XOR-Based Double-Erasure Codes
    Wang, Gang
    Liu, Xiaoguang
    Li, Ang
    Zhang, Fan
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2009, 5759 : 84 - 93
  • [25] Enhanced XOR-Based Progressive Visual Secret Sharing Using Multiple Decryptions
    Sachan, Vishal Singh
    Yadav, Mainejar
    Ranvijay
    ADVANCES IN VLSI, COMMUNICATION, AND SIGNAL PROCESSING, 2020, 587 : 949 - 962
  • [26] XOR-based progressive visual secret sharing using generalized random grids
    Chao, Her-Chang
    Fan, Tzuo-Yau
    DISPLAYS, 2017, 49 : 6 - 15
  • [27] A Chunk-Based Hash Table Caching Method for In-Memory Hash Joins
    Wei, Xing
    Hu, Huiqi
    Zhou, Xuan
    Zhou, Aoying
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT II, 2020, 12343 : 376 - 389
  • [28] FPGA implementation of AES algorithm for high throughput using folded parallel architecture
    Rahimunnisa, K.
    Karthigaikumar, P.
    Rasheed, Soumiya
    Jayakumar, J.
    SureshKumar, S.
    SECURITY AND COMMUNICATION NETWORKS, 2014, 7 (11) : 2225 - 2236
  • [29] High throughput Parallel Montgomery Modular Exponentiation on FPGA
    Nadjia, Anane
    Mohamed, Anane
    2014 9TH INTERNATIONAL DESIGN & TEST SYMPOSIUM (IDT), 2014, : 225 - 230
  • [30] Ultra High Throughput Implementations for MD5 Hash Algorithm on FPGA
    Wang, Yuliang
    Zhao, Qiuxia
    Jiang, Liehui
    Shao, Yi
    HIGH PERFORMANCE COMPUTING AND APPLICATIONS, 2010, 5938 : 433 - 441