Two Simple Tricks for Fast Cache-Aware Parallel Particle Swarm Optimization

被引:0
|
作者
Hajewski, Jeff [1 ]
Oliveira, Suely [1 ]
机构
[1] Univ Iowa, Dept Comp Sci, Iowa City, IA 52242 USA
关键词
Particle Swarm Optimization; Data Oriented Design; Parallel PSO; CORE;
D O I
10.1109/cec.2019.8790219
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Particle Swarm Optimization is an example of a trivially parallelizable algorithm where good performance gains can be achieved through the use of a few OpenMP pragmas. Writing an efficient parallel PSO algorithm, however, is much more challenging because although particle updates can occur independently, they rely on a shared global state (the globally best particle). The difficulty of maintaining this global state can be seen in the large body of work studying the parallelization of PSO - almost uniformly these algorithms rely on a global synchronization step, which can result in idle cores and reduced parallel efficiency. In this work, we explore two techniques for implementing a fast, cache-aware parallel PSO algorithm: batching the creation of the random weights and reducing critical section contention via a relaxed consistency guarantee. Our technique shows impressive performance improvements over prior work, seeing more than 60% speed-up over naive parallelization and more than 10% speed-up over the cache-aware algorithm. This speed comes at a cost; while our method quickly reaches an approximate solution, it struggles in environments requiring a high level of resolution. Despite these trade-offs, our method is both easy to understand and implement and is widely transferable to other swarm intelligence algorithms.
引用
收藏
页码:1374 / 1381
页数:8
相关论文
共 50 条
  • [1] Cache-aware optimization of BAN applications
    Lei Ju
    Yun Liang
    Samarjit Chakraborty
    Tulika Mitra
    Abhik Roychoudhury
    Design Automation for Embedded Systems, 2009, 13 : 159 - 178
  • [2] Cache-aware optimization of BAN applications
    Ju, Lei
    Liang, Yun
    Chakraborty, Samarjit
    Mitra, Tulika
    Roychoudhury, Abhik
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2009, 13 (03) : 159 - 178
  • [3] A Simple and Fast Particle Swarm Optimization
    Wang, Hui
    Wu, Zhijian
    Zeng, Sanyou
    Jiang, Dazhi
    Liu, Yong
    Wang, Jing
    Yang, Xianqiang
    JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2010, 16 (06) : 611 - 629
  • [4] An overview of cache optimization techniques and cache-aware numerical algorithms
    Kowarschik, M
    Weiss, C
    ALGORITHMS FOR MEMORY HIERARCHIES: ADVANCED LECTURES, 2003, 2625 : 213 - 232
  • [5] Cache-Aware Dynamic Skewed Tree for Fast Memory Authentication
    Vig, Saru
    Lam, Siew-Kei
    Juneja, Rohan
    2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2021, : 402 - 407
  • [6] CareDedup: Cache-aware Deduplication for Reading Performance Optimization in Primary Storage
    Lin, Bin
    Li, Shanshan
    Liao, Xiangke
    Liu, Xiaodong
    Zhang, Jing
    Jia, Zhouyang
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 1 - 10
  • [7] A Rapid Cache-aware Procedure Positioning Optimization to Favor Incremental Development
    Mezzetti, Enrico
    Vardanega, Tullio
    2013 IEEE 19TH REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS), 2013, : 107 - 116
  • [8] Multiple query optimization by cache-aware middleware using query teamwork
    O'Gorman, K
    Agrawal, D
    El Abbadi, A
    18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 274 - 274
  • [9] Cache-Aware Query Optimization in Multiapplication Sharing Wireless Sensor Networks
    Zhou, Zhangbing
    Zhao, Deng
    Hancke, Gerhard
    Shu, Lei
    Sun, Yunchuan
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (03): : 401 - 417
  • [10] A fast, cache-aware algorithm for the calculation of radiological paths exploiting subword parallelism
    Christiaens, M
    De Sutter, B
    De Bosschere, K
    Van Campenhout, J
    Lemahieu, I
    JOURNAL OF SYSTEMS ARCHITECTURE, 1999, 45 (10) : 781 - 790