Scalable Suffix Sorting on a Multicore Machine

被引:6
|
作者
Xie, Jing Yi [1 ]
Nong, Ge [1 ]
Lao, Bin [2 ]
Xu, Wentao [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
[2] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou 510420, Peoples R China
基金
中国国家自然科学基金;
关键词
Sorting; Random access memory; Indexes; Multicore processing; Arrays; Task analysis; Big Data; Suffix sorting; algorithm design; multicore computer; ARRAY CONSTRUCTION;
D O I
10.1109/TC.2020.2972546
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A number of methods have been proposed for suffix sorting on internal memory of RAM and external memory of hard disks. The current best results for suffix sorting on internal or external memory are achieved by several algorithms using the induced sorting (IS) method in various ways. While these algorithms are efficient, the internal ones are much different from those external in terms of the algorithm designs. A scalable IS method that can be applied for suffix sorting on both internal and external memory is highly desired. This article proposes a blockwise IS method to facilitate pipelined access on internal memory and sequential I/Os on external memory. The detailed algorithm of using this method for a 4-stage pipeline with multiple threads is described, where multiple threads are applied to parallelize not only the pipelined stages of consecutive blocks but also the tasks within each stage wherever possible. This algorithm is evaluated by our experiments on a set of realistic and artificial datasets to achieve better overall time and space performance than the existing best results from pSACAK, pDSS and pKS. Beside sorting suffixes on internal memory in linear time, the proposed method can be ported to external memory for sorting massive suffixes in linear I/O complexity.
引用
收藏
页码:1364 / 1375
页数:12
相关论文
共 50 条
  • [1] Fast In-Place Suffix Sorting on a Multicore Computer
    Lao, Bin
    Nong, Ge
    Chan, Wai Hong
    Xie, Jing Yi
    IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (12) : 1737 - 1749
  • [2] Fast induced sorting suffixes on a multicore machine
    Lao, Bin
    Nong, Ge
    Chan, Wai Hong
    Pan, Yi
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (07): : 3468 - 3485
  • [3] Fast induced sorting suffixes on a multicore machine
    Bin Lao
    Ge Nong
    Wai Hong Chan
    Yi Pan
    The Journal of Supercomputing, 2018, 74 : 3468 - 3485
  • [4] Faster suffix sorting
    Larsson, N. Jesper
    Sadakane, Kunihiko
    THEORETICAL COMPUTER SCIENCE, 2007, 387 (03) : 258 - 272
  • [5] In-place suffix sorting
    Franceschini, G.
    Muthukrishnan, S.
    AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2007, 4596 : 533 - +
  • [6] Faster Sparse Suffix Sorting
    Tomohiro, I
    Karkkainen, Juha
    Kempa, Dominik
    31ST INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2014), 2014, 25 : 386 - 396
  • [7] An Efficient Algorithm for Suffix Sorting
    Peng, Zhan
    Wang, Yuping
    Xue, Xingsi
    Wei, Jingxuan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (06)
  • [8] Grammar Compression by Induced Suffix Sorting
    Nunes D.S.N.
    Louza F.A.
    Gog S.
    Ayala-Rincón M.
    Navarro G.
    ACM Journal of Experimental Algorithmics, 2022, 27 (02):
  • [9] Optimal In-Place Suffix Sorting
    Li, Zhize
    Li, Jian
    Huo, Hongwei
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2018, 2018, 11147 : 268 - 284
  • [10] Succinct suffix sorting in external memory
    Han, Ling Bo
    Wu, Yi
    Nong, Ge
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (01)