Speedup Analysis of Data-parallel Applications on Multi-core NoCs

被引:5
|
作者
Chen, Xiaowen [1 ]
Lu, Zhonghai [2 ]
Jantsch, Axel [2 ]
Chen, Shuming [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp Sci, Inst Microelect, Changsha 410073, Hunan, Peoples R China
[2] Royal Inst Technol, Dept Elect Comp & Software Syst, SE-10044 Stockholm, Sweden
关键词
speedup; communication; multi-core; NoC; AMDAHLS LAW;
D O I
10.1109/ASICON.2009.5351597
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As more computing cores are integrated onto a single chip, the effect of network communication latency is becoming more and more significant on Multi-core Network-on-Chips (NoCs). For data-parallel applications, we study the model of parallel speedup by including network communication latency in Amdahl's law. The speedup analysis considers the effect of network topology, network size, traffic model and computation/communication ratio. We also study the speedup efficiency. In our Multi-core NoC platform, a real data-parallel application, i.e. matrix multiplication, is used to validate the analysis. Our theoretical analysis and the application results show that the speedup improvement is nonlinear and the speedup efficiency decreases as the system size is scaled up. Such analysis can be used to guide architects and programmers to improve parallel processing efficiency by reducing network latency with optimized network design and increasing computation proportion in the program.
引用
收藏
页码:105 / +
页数:2
相关论文
共 50 条
  • [1] A branch-and-bound approach to scheduling of data-parallel tasks on multi-core architectures
    Liu, Yang
    Meng, Lin
    Taniguchi, Ittetsu
    Tomiyama, Hiroyuki
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2020, 12 (01) : 125 - 135
  • [2] On the maturity of parallel applications for asymmetric multi-core processors
    Chronaki, Kallia
    Moreto, Miguel
    Casas, Marc
    Rico, Alejandro
    Badia, Rosa M.
    Ayguade, Eduard
    Valero, Mateo
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 127 : 105 - 115
  • [3] Parallel Syntax Analysis on Multi-Core Machines
    Barve, Amit
    Joshi, Brijendra Kumar
    2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 209 - 213
  • [4] Brief Announcement: Optimal Speedup on a Low-Degree Multi-Core Parallel Architecture (LoPRAM)
    Dorrigiv, Reza
    Lopez-Ortiz, Alejandro
    Salinger, Alejandro
    SPAA'08: PROCEEDINGS OF THE TWENTIETH ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2008, : 185 - 187
  • [5] SDPA: An Optimizer for Program Analysis of Data-Parallel Applications
    Wang, Fei
    Shi, Xuanhua
    Yu, Dongxiao
    Ke, Zhixiang
    Jin, Hai
    Wu, Song
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 14 - 21
  • [6] Self-Recovering Parallel Applications in Multi-Core Systems
    Bizot, Gilles
    Avresky, Dimiter
    Chaix, Fabien
    Zergainoh, Nacer-Eddine
    Nicolaidis, Michael
    2011 10TH IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2011,
  • [7] Efficient Parallel Execution of Streaming Applications on Multi-Core Processors
    Schuele, Tobias
    PROCEEDINGS OF THE 19TH INTERNATIONAL EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING, 2011, : 231 - 238
  • [8] Parallel points-to analysis for multi-core machines
    School of Computer Science, Physics and Mathematics, Linnaeus University, 35195 Växjö, Sweden
    HiPEAC - Proc. Int. Conf. High Perform. Embedded Archit. Compilers, (45-54):
  • [9] Fast parallel lexical analysis on multi-core machines
    Barve A.
    Joshi B.K.
    Barve, Amit (barve.amit@gmail.com), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (09): : 250 - 257
  • [10] A design methodology for data-parallel applications
    Nyland, LS
    Prins, JF
    Goldberg, A
    Mills, PH
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2000, 26 (04) : 293 - 314