A Parallel Quicksort Algorithm on Manycore Processors in Sunway TaihuLight

被引:0
|
作者
Ren, Siyuan [1 ]
Xu, Shizhen [1 ]
Yang, Guangwen [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
来源
关键词
D O I
10.1007/978-3-319-93713-7_61
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we present a highly efficient parallel quicksort algorithm on SW26010, a heterogeneous manycore processor that makes Sunway TaihuLight the Top-One supercomputer in the world. Motivated by the software-cache and on-chip communication design of SW26010, we propose a two-phase quicksort algorithm, with the first counting elements and the second moving elements. To make the best of such manycore architecture, we design a decentralized workflow, further optimize the memory access and balance the workload. Experiments show that our algorithm scales efficiently to 64 cores of SW26010, achieving more than 32X speedup for int32 elements on all kinds of data distributions. The result outperforms the strong scaling one of Intel TBB (Threading Building Blocks) version of quicksort on x86-64 architecture.
引用
收藏
页码:647 / 653
页数:7
相关论文
共 50 条
  • [1] swPTS: an efficient parallel Thomas split algorithm for tridiagonal systems on Sunway manycore processors
    Min Tian
    Qi Liu
    Jingshan Pan
    Ying Gou
    Zanjun Zhang
    [J]. The Journal of Supercomputing, 2024, 80 : 4682 - 4706
  • [2] swPTS: an efficient parallel Thomas split algorithm for tridiagonal systems on Sunway manycore processors
    Tian, Min
    Liu, Qi
    Pan, Jingshan
    Gou, Ying
    Zhang, Zanjun
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (04): : 4682 - 4706
  • [3] Heterogeneous Parallel Algorithm Design and Performance Optimization for WENO on the Sunway TaihuLight Supercomputer
    Huang, Jianqiang
    Han, Wentao
    Wang, Xiaoying
    Chen, Wenguang
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2020, 25 (01) : 56 - 67
  • [4] Heterogeneous Parallel Algorithm Design and Performance Optimization for WENO on the Sunway TaihuLight Supercomputer
    Jianqiang Huang
    Wentao Han
    Xiaoying Wang
    Wenguang Chen
    [J]. Tsinghua Science and Technology, 2020, 25 (01) : 56 - 67
  • [5] Implementation and Optimization of AES Algorithm on the Sunway TaihuLight
    Chen, Yuedan
    Li, Kenli
    Fei, Xiongwei
    Quan, Zhe
    Li, Keqin
    [J]. 2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 256 - 261
  • [6] Regional Ocean Model Parallel Optimization in Sunway TaihuLight
    Wu, Qi
    Ni, Yufang
    Huang, Xiaomeng
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (07): : 1556 - 1566
  • [7] A Novel Acceleration Method for DGTD Algorithm on Sunway TaihuLight
    Chen, Geng
    Zhao, Lei
    Yu, Wenhua
    Ren, Hu
    Fu, Haohuan
    [J]. PROCEEDINGS OF THE 2018 IEEE 7TH ASIA-PACIFIC CONFERENCE ON ANTENNAS AND PROPAGATION (APCAP), 2018, : 153 - 154
  • [8] SW-AES: Accelerating AES Algorithm on the Sunway TaihuLight
    Li, Liandeng
    Fang, Jiarui
    Jiang, Jinlei
    Gan, Lin
    Zheng, Weijie
    Fu, Haohuan
    Yang, Guanwen
    [J]. 2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 1204 - 1211
  • [9] swCaffe: a Parallel Framework for Accelerating Deep Learning Applications on Sunway TaihuLight
    Li, Liandeng
    Fang, Jiarui
    Fu, Haohuan
    Jiang, Jinlei
    Zhao, Wenlai
    He, Conghui
    You, Xin
    Yang, Guangwen
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2018, : 413 - 422
  • [10] A PARALLEL QUICKSORT ALGORITHM
    BROWN, T
    XIONG, RB
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1993, 19 (02) : 83 - 89