Parallelizing Sequential Applications on Commodity Hardware using a Low-cost Software Transactional Memory

被引:17
|
作者
Mehrara, Mojtaba [1 ]
Hao, Jeff [1 ]
Hsu, Po-Chun [1 ]
Mahlke, Scott [1 ]
机构
[1] Univ Michigan, Adv Comp Architecture Lab, Ann Arbor, MI 48109 USA
关键词
Languages; Algorithms; Design; Performance; Software transactional memory; Automatic parallelization; Thread-level speculation; Loop level parallelism; Profile-guided optimization;
D O I
10.1145/1543135.1542495
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multicore designs have emerged as the mainstream design paradigm for the microprocessor industry. Unfortunately, providing multiple cores does not directly translate into performance for most applications. The industry has already fallen short of the decades-old performance trend of doubling performance every 18 months. An attractive approach for exploiting multiple cores is to rely on tools, both compilers and runtime optimizers, to automatically extract threads from sequential applications. However, despite decades of research on automatic parallelization, most techniques are only effective in the scientific and data parallel domains where array dominated codes can be precisely analyzed by the compiler. Thread-level speculation offers the opportunity to expand parallelization to general-purpose programs, but at the cost of expensive hardware support. In this paper, we focus on providing low-overhead software support for exploiting speculative parallelism. We propose STMlite, a light-weight software transactional memory model that is customized to facilitate profile-guided automatic loop parallelization. STMlite eliminates a considerable amount of checking and locking overhead in conventional software transactional memory models by decoupling the commit phase from main transaction execution. Further, strong atomicity requirements for generic transactional memories are unnecessary within a stylized automatic parallelization framework. STMlite enables sequential applications to extract meaningful performance gains on commodity multicore hardware.
引用
收藏
页码:166 / 176
页数:11
相关论文
共 50 条
  • [1] Parallelizing Sequential Applications on Commodity Hardware using a Low-cost Software Transactional Memory
    Mehrara, Mojtaba
    Hao, Jeff
    Hsu, Po-Chun
    Mahlke, Scott
    [J]. PLDI'09 PROCEEDINGS OF THE 2009 ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION, 2009, : 166 - 176
  • [2] QuakeTM: Parallelizing a Complex Sequential Application Using Transactional Memory
    Gajinov, Vladimir
    Zyulkyarov, Ferad
    Unsal, Osman S.
    Cristal, Adrian
    Ayguade, Eduard
    Harris, Tim
    Valero, Mateo
    [J]. ICS'09: PROCEEDINGS OF THE 2009 ACM SIGARCH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2009, : 126 - 135
  • [3] Hardware Acceleration of Transactional Memory on Commodity Systems
    Casper, Jared
    Oguntebi, Tayo
    Hong, Sungpack
    Bronson, Nathan G.
    Kozyrakis, Christos
    Olukotun, Kunle
    [J]. ACM SIGPLAN NOTICES, 2011, 46 (03) : 27 - 38
  • [4] Virtues and Limitations of Commodity Hardware Transactional Memory
    Diegues, Nuno
    Romano, Paolo
    Rodrigues, Luis
    [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 3 - 14
  • [5] SOHO SOFTWARE FOLLOWS LOW-COST HARDWARE
    OMALLEY, C
    [J]. BYTE, 1993, 18 (11): : 32 - 32
  • [6] Indoor Localisation System Based On Low-Cost Commodity Hardware
    Apetroaie-Cristea, Mihaela
    Johnston, Steven J.
    Scott, Mark
    Cox, Simon J.
    [J]. UBICOMP'16 ADJUNCT: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, 2016, : 13 - 16
  • [7] Hardware Support for Low-Cost Memory Safety
    Boivie, Rick
    Saileshwar, Gururaj
    Chen, Tong
    Segal, Benjamin
    Buyuktosunoglu, Alper
    [J]. 51ST ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS - SUPPLEMENTAL VOL (DSN 2021), 2021, : 57 - 60
  • [8] TxRace: Efficient Data Race Detection Using Commodity Hardware Transactional Memory
    Zhang, Tong
    Lee, Dongyoon
    Jung, Changhee
    [J]. ACM SIGPLAN NOTICES, 2016, 51 (04) : 159 - 173
  • [9] Parallelizing Stochastic Gradient Descent with Hardware Transactional Memory for Matrix Factorization
    Wu, Zhenwei
    Luo, Yingqi
    Lu, Kai
    Wang, Xiaoping
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS ENGINEERING (ICISE), 2018, : 118 - 121
  • [10] Transactional memory: The hardware-software interface
    McDonald, Austen
    Carlstrom, Brian D.
    Chung, JaeWoong
    Minh, Chi Cao
    Chafi, Hassan
    Kozyrakis, Christos
    Olukotun, Kunle
    [J]. IEEE MICRO, 2007, 27 (01) : 67 - 76