In-Place Data Sliding Algorithms for Many-Core Architectures

被引:10
|
作者
Gomez-Luna, Juan [1 ]
Chang, Li-Wen [2 ]
Hwu, Wen-Mei W. [2 ]
Sung, I-Jui [3 ]
Guil, Nicolas [4 ]
机构
[1] Univ Cordoba, Comp Architecture & Elect, Cordoba, Spain
[2] Univ Illinois, Elect & Comp Engn, Urbana, IL 61801 USA
[3] MulticoreWare Inc, Champaign, IL USA
[4] Univ Malaga, Comp Architecture, Malaga, Spain
关键词
in-place; stream compaction; relational algebra;
D O I
10.1109/ICPP.2015.30
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In-place data manipulation is very desirable in many-core architectures with limited on-board memory. This paper deals with the in-place implementation of a class of primitives that perform data movements in one direction. We call these primitives Data Sliding (DS) algorithms. Notable among them are relational algebra primitives (such as select and unique), padding to insert empty elements in a data structure, and stream compaction to reduce memory requirements. Their in-place implementation in a bulk synchronous parallel model, such as GPUs, is specially challenging due to the difficulties in synchronizing threads executing on different compute units. Using a novel adjacent work-group synchronization technique, we propose two algorithmic schemes for regular and irregular DS algorithms. With a set of 5 benchmarks, we validate our approaches and compare them to the state-of-the-art implementations of these benchmarks. Our regular DS algorithms demonstrate up to 9.11x and 73.25x on NVIDIA and AMD GPUs, respectively, the throughput of their competitors. Our irregular DS algorithms outperform NVIDIA Thrust library by up to 3.24x on the three most recent generations of NVIDIA GPUs.
引用
收藏
页码:210 / 219
页数:10
相关论文
共 50 条
  • [21] Power Gating Clustered Many-Core Architectures
    Musoll, Enric
    JOURNAL OF LOW POWER ELECTRONICS, 2008, 4 (03) : 290 - 300
  • [22] A Power Modelling Approach for Many-core Architectures
    Lai, Zhiquan
    Lam, King Tin
    Wang, Cho-Li
    Su, Jinshu
    2014 10TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2014, : 128 - 132
  • [23] On the Complexity of Mapping Feasibility in Many-Core Architectures
    Schwarzer, Tobias
    Roloff, Sascha
    Richthammer, Valentina
    Khaldi, Rami
    Wildermann, Stefan
    Glass, Michael
    Teich, Juergen
    2018 IEEE 12TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2018), 2018, : 176 - 183
  • [24] Accelerating Dedispersion Using Many-core Architectures
    Novotny, Jan
    Adamek, Karel
    Clark, M. A.
    Giles, Mike
    Armour, Wes
    ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 2023, 269 (01):
  • [25] Fast Convolution Operations on Many-Core Architectures
    Li, Shigang
    Zhang, Yunquan
    Xiang, Chunyang
    Shi, Lei
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 316 - 323
  • [26] Many-core algorithms for statistical phylogenetics
    Suchard, Marc A.
    Rambaut, Andrew
    BIOINFORMATICS, 2009, 25 (11) : 1370 - 1376
  • [27] Algorithms for Scheduling Task-based Applications onto Heterogeneous Many-core Architectures
    Kinsy, Michel A.
    Devadas, Srinivas
    2014 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2014,
  • [28] Radio Astronomy Beam Forming on Many-Core Architectures
    Sclocco, Alessio
    Varbanescu, Ana Lucia
    Mol, Jan David
    van Nieuwpoort, Rob V.
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 1105 - 1116
  • [29] Towards Efficient SpMV on Sunway Many-core Architectures
    Liu, Changxi
    Xie, Biwei
    Liu, Xin
    Xue, Wei
    Yang, Hailong
    Liu, Xu
    INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 363 - 373
  • [30] Accelerating Asian option pricing on many-core architectures
    Li, Shuo
    Lin, James
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (03): : 848 - 865