LIME: A Framework for Debugging Load Imbalance in Multi-threaded Execution

被引:0
|
作者
Oh, Jungju [1 ]
Hughes, Christopher J. [2 ]
Venkataramani, Guru [3 ]
Prvulovic, Milos [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Intel Corp, Santa Clara, CA 95054 USA
[3] George Washington Univ, Washington, DC 20052 USA
基金
美国国家科学基金会;
关键词
Load imbalance; Performance debugging; Parallel section;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the ubiquity of multi-core processors, software must make effective use of multiple cores to obtain good performance on modern hardware. One of the biggest roadblocks to this is load imbalance, or the uneven distribution of work across cores. We propose LIME, a framework for analyzing parallel programs and reporting the cause of load imbalance in application source code. This framework uses statistical techniques to pinpoint load imbalance problems stemming from both control flow issues (e. g., unequal iteration counts) and interactions between the application and hardware (e. g., unequal cache miss counts). We evaluate LIME on applications from widely used parallel benchmark suites, and show that LIME accurately reports the causes of load imbalance, their nature and origin in the code, and their relative importance.
引用
收藏
页码:201 / 210
页数:10
相关论文
共 50 条
  • [1] Execution replay and debugging of distributed multi-threaded parallel programs
    de Kergommeaux, JC
    Ronsse, M
    De Bosschere, K
    [J]. COMPUTERS AND ARTIFICIAL INTELLIGENCE, 2000, 19 (06): : 511 - 526
  • [2] SAC - A functional array language for efficient multi-threaded execution
    Grelck, Clemens
    Scholz, Sven-Bodo
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2006, 34 (04) : 383 - 427
  • [3] A Framework for Systematic Testing of Multi-threaded Applications
    Florian, Mihai
    [J]. 2011 IEEE 17TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC), 2011, : 278 - 279
  • [4] Design of a multi-threaded distributed telerobotic framework
    Al-Mouhamed, M
    Toker, O
    Iqbal, A
    [J]. ICECS 2003: PROCEEDINGS OF THE 2003 10TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-3, 2003, : 1280 - 1283
  • [5] Signature Oriented Model Pruning to Facilitate Multi-Threaded Processors Debugging
    Refan, Fatemeh
    Alizadeh, Bijan
    Navabi, Zainalabedin
    [J]. 2015 IEEE 33RD VLSI TEST SYMPOSIUM (VTS), 2015,
  • [6] LUNA: Hard Real-Time, Multi-Threaded, CSP-Capable Execution Framework
    Bezemer, M. M.
    Wilterdink, R. J. W.
    Broenink, J. F.
    [J]. COMMUNICATING PROCESS ARCHITECTURES 2011, 2011, 68 : 157 - 175
  • [7] SAC—A Functional Array Language for Efficient Multi-threaded Execution
    Clemens Grelck
    Sven-Bodo Scholz
    [J]. International Journal of Parallel Programming, 2006, 34 : 383 - 427
  • [8] Load balancing for network based multi-threaded applications
    Krone, O
    Raab, M
    Hirsbrunner, B
    [J]. RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, 1998, 1497 : 206 - 214
  • [9] ANALYSIS OF MULTI-THREADED CODE EXECUTION ON SMALL MULTI-CORE ARCHITECTURES
    Sgroi, Kevin J.
    Spetka, Scott E.
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, DETC 2010, VOL 3, A AND B, 2010, : 807 - 814
  • [10] Efficient Checkpointing of Multi-Threaded Applications as a Tool for Debugging, Performance Tuning, and Resiliency
    Grossman, Max
    Sarkar, Vivek
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 232 - 241