Speedup Analysis of Data-parallel Applications on Multi-core NoCs

被引：5

作者：

Chen, Xiaowen ^{[1
]}

Lu, Zhonghai ^{[2
]}

Jantsch, Axel ^{[2
]}

Chen, Shuming ^{[1
]}

机构：

[1] Natl Univ Def Technol, Sch Comp Sci, Inst Microelect, Changsha 410073, Hunan, Peoples R China

[2] Royal Inst Technol, Dept Elect Comp & Software Syst, SE-10044 Stockholm, Sweden

来源：

2009 IEEE 8TH INTERNATIONAL CONFERENCE ON ASIC, VOLS 1 AND 2, PROCEEDINGS | 2009年

关键词：

speedup; communication; multi-core; NoC; AMDAHLS LAW;

D O I：

10.1109/ASICON.2009.5351597

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

As more computing cores are integrated onto a single chip, the effect of network communication latency is becoming more and more significant on Multi-core Network-on-Chips (NoCs). For data-parallel applications, we study the model of parallel speedup by including network communication latency in Amdahl's law. The speedup analysis considers the effect of network topology, network size, traffic model and computation/communication ratio. We also study the speedup efficiency. In our Multi-core NoC platform, a real data-parallel application, i.e. matrix multiplication, is used to validate the analysis. Our theoretical analysis and the application results show that the speedup improvement is nonlinear and the speedup efficiency decreases as the system size is scaled up. Such analysis can be used to guide architects and programmers to improve parallel processing efficiency by reducing network latency with optimized network design and increasing computation proportion in the program.

引用

页码：105 / +

页数：2

共 50 条

[1] A branch-and-bound approach to scheduling of data-parallel tasks on multi-core architectures
Liu, Yang
Meng, Lin
Taniguchi, Ittetsu
Tomiyama, Hiroyuki
INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2020, 12 (01) : 125 - 135
[2] On the maturity of parallel applications for asymmetric multi-core processors
Chronaki, Kallia
Moreto, Miguel
Casas, Marc
Rico, Alejandro
Badia, Rosa M.
Ayguade, Eduard
Valero, Mateo
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 127 : 105 - 115
[3] Parallel Syntax Analysis on Multi-Core Machines
Barve, Amit
Joshi, Brijendra Kumar
2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 209 - 213
[4] Brief Announcement: Optimal Speedup on a Low-Degree Multi-Core Parallel Architecture (LoPRAM)
Dorrigiv, Reza
Lopez-Ortiz, Alejandro
Salinger, Alejandro
SPAA'08: PROCEEDINGS OF THE TWENTIETH ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2008, : 185 - 187
[5] SDPA: An Optimizer for Program Analysis of Data-Parallel Applications
Wang, Fei
Shi, Xuanhua
Yu, Dongxiao
Ke, Zhixiang
Jin, Hai
Wu, Song
IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 14 - 21
[6] Self-Recovering Parallel Applications in Multi-Core Systems
Bizot, Gilles
Avresky, Dimiter
Chaix, Fabien
Zergainoh, Nacer-Eddine
Nicolaidis, Michael
2011 10TH IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2011,
[7] Efficient Parallel Execution of Streaming Applications on Multi-Core Processors
Schuele, Tobias
PROCEEDINGS OF THE 19TH INTERNATIONAL EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING, 2011, : 231 - 238
[8] Parallel points-to analysis for multi-core machines
School of Computer Science, Physics and Mathematics, Linnaeus University, 35195 Växjö, Sweden
HiPEAC - Proc. Int. Conf. High Perform. Embedded Archit. Compilers, (45-54):
[9] Fast parallel lexical analysis on multi-core machines
Barve A.
Joshi B.K.
Barve, Amit (barve.amit@gmail.com), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (09): : 250 - 257
[10] A design methodology for data-parallel applications
Nyland, LS
Prins, JF
Goldberg, A
Mills, PH
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2000, 26 (04) : 293 - 314

← 1 2 3 4 5 →