Scalable energy-efficient parallel sorting on a fine-grained many-core processor array

被引:0
|
作者
Stillmaker, Aaron [1 ,2 ]
Bohnenstiehl, Brent [1 ]
Stillmaker, Lucas [1 ]
Baas, Bevan [1 ]
机构
[1] Univ Calif Davis, Elect & Comp Engn Dept, One Shields Ave, Davis, CA 95616 USA
[2] Calif State Univ, Elect & Comp Engn Dept, Fresno 2320 E San Ramon Ave, Fresno, CA 93740 USA
基金
美国国家科学基金会;
关键词
Parallel processing; External sorting; Scalable sorting; Fine-grained many-core; Processor array; PLATFORM; ASAP; NM;
D O I
10.1016/j.jpdc.2019.12.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Three parallel sorting applications and two list output protocols for the first phase of an external sort execute on a fine-grained many-core processor array that contains no algorithm-specific hardware acting as a co-processor with a variety of array sizes. Results are generated using a cycle-accurate model based on measured data from a fabricated many-core chip, and simulated for different processor array sizes. The data shows most energy efficient first-phase many-core sort requires over 65x lower energy than GNU C++ standard library sort performed on an Intel laptop-class processor and over 105x lower energy than a radix sort running on an Nvidia GPU. In addition, the highest first-phase throughput many-core sort is over 9.8x faster than the std::sort and over 14x faster than the radix sort. Both phases of a 10 GB external sort require 6.2x lower energyx time energy delay product than the std::sort and over 13x lower energyx time than the radix sort. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:32 / 47
页数:16
相关论文
共 50 条
  • [1] Fine-Grained Energy-Efficient Sorting on a Many-Core Processor Array
    Stillmaker, Aaron
    Stillmaker, Lucas
    Baas, Bevan
    [J]. PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, : 652 - 659
  • [2] Thermal Management of a Many-Core Processor under Fine-Grained Parallelism
    Keceli, Fuat
    Moreshet, Tali
    Vishkin, Uzi
    [J]. EURO-PAR 2011: PARALLEL PROCESSING WORKSHOPS, PT I, 2012, 7155 : 249 - 259
  • [3] Display Stream Compression Decoders for Fine-Grained Many-Core Processor Arrays
    Wu, Shifu
    Baas, Bevan M.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (05) : 1730 - 1734
  • [4] Study on Fine-grained Synchronization in Many-Core Architecture
    Yu, Lei
    Liu, Zhiyong
    Fan, Dongrui
    Song, Fenglong
    Zhang, Junchao
    Yuan, Nan
    [J]. SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 524 - 529
  • [5] A Fine-Grained Parallel Particle Swarm Optimization on Many-core and Multi-core Architectures
    Nedjah, Nadia
    Calazan, Rogerio de Moraes
    Mourelle, Luiza de Macedo
    [J]. PARALLEL COMPUTING TECHNOLOGIES (PACT 2017), 2017, 10421 : 215 - 224
  • [6] A High-Performance Parallel CAVLC Encoder on a Fine-Grained Many-Core System
    Xiao, Zhibin
    Baas, Bevan
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2008, : 248 - 254
  • [7] AsAP: A fine-grained many-core platform for DSP applications
    Baas, Bevan
    Yu, Zhiyi
    Meeuwsen, Michael
    Sattari, Omar
    Apperson, Ryan
    Work, Eric
    Webb, Jeremy
    Lai, Michael
    Mohsenin, Tinoosh
    Truong, Dean
    Cheung, Jason
    [J]. IEEE MICRO, 2007, 27 (02) : 34 - 45
  • [8] Energy-efficient canonical Huffman decoders on many-core processor arrays and FPGAs
    Sarangi, Satyabrata
    Baas, Bevan
    [J]. INTEGRATION-THE VLSI JOURNAL, 2023, 88 : 156 - 165
  • [9] Enabling Extremely Fine-grained Parallelism via Scalable Concurrent Queues on Modern Many-core Architectures
    Nookala, Poornima
    Dinda, Peter
    Hale, Kyle C.
    Chard, Kyle
    Raicu, Ioan
    [J]. 29TH INTERNATIONAL SYMPOSIUM ON THE MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2021), 2021, : 65 - 72
  • [10] XGRID: A Scalable Many-Core Embedded Processor
    Gunes, Volkan
    Givargis, Tony
    [J]. 2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1143 - 1146