Profile-Guided Parallel Task Extraction and Execution for Domain Specific Heterogeneous SoC

被引:2
|
作者
Chang, Liangliang [1 ]
Mack, Joshua [2 ]
Willis, Benjamin [1 ]
Chen, Xing [1 ]
Brunhaver, John [1 ]
Akoglu, Ali [2 ]
Chakrabarti, Chaitali [1 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85287 USA
[2] Univ Arizona, Elect & Comp Engn Dept, Tucson, AZ 85721 USA
关键词
Task-level parallelism; dynamic profiling; heterogeneous SoC and runtime; parallelism detection;
D O I
10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we introduce a methodology for automatically transforming user applications in the radar and communication domain written in C/C++ based on dynamic profiling to a parallel representation targeted for a heterogeneous SoC. We present our approach for instrumenting the user application binary during the compilation process with barrier synchronization primitives that enable runtime system schedule and execute independent tasks concurrently over the available compute resources. We demonstrate the capabilities of our integrated compile time and runtime flow through task-level parallel and functionally correct execution of real-life applications. We perform validation of our integrated system by executing four distinct applications each carrying various degrees of task level parallelism over the Xeon-based multi-core homogeneous processor. We use the proposed compilation and code transformation methodology to re-target each application for execution on a heterogeneous SoC composed of three ARM cores and one FFT accelerator that is emulated on the Xilinx Zynq UltraScale+ platform. We demonstrate our runtime's ability to process application binary, dispatch independent tasks over the available compute resources of the emulated SoC on the Zynq FPGA based on three different scheduling heuristics. Finally we demonstrate execution of each application individually with task level parallelism on the Zynq FPGA and execution of workload scenarios composed of multiple instances of the same application as well as mixture of two distinct applications to demonstrate ability to realize both application and task level parallel execution. Our integrated approach offers a path forward for application developers to take full advantage of the target SoC without requiring users to become hardware and parallel programming experts.
引用
收藏
页码:913 / 920
页数:8
相关论文
共 18 条
  • [1] Profile-Guided Application Partitioning for Heterogeneous Reconfigurable Platforms
    Ostadzadeh, S. Arash
    Meeuws, Roel
    Ashraf, Imran
    Galuzzi, Carlo
    Bertels, Koen
    2012 16TH CSI INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND DIGITAL SYSTEMS (CADS), 2012, : 37 - 43
  • [2] Building application-specific operating systems: a profile-guided approach
    Yuan, Pengfei
    Guo, Yao
    Zhang, Lu
    Chen, Xiangqun
    Mei, Hong
    SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (09)
  • [3] Building application-specific operating systems:a profile-guided approach
    Pengfei YUAN
    Yao GUO
    Lu ZHANG
    Xiangqun CHEN
    Hong MEI
    ScienceChina(InformationSciences), 2018, 61 (09) : 21 - 37
  • [4] Building application-specific operating systems: a profile-guided approach
    Pengfei Yuan
    Yao Guo
    Lu Zhang
    Xiangqun Chen
    Hong Mei
    Science China Information Sciences, 2018, 61
  • [5] Towards Profile-Guided Optimization for Safe and Efficient Parallel Stream Processing in Rust
    Sydow, Stefan
    Nabelsee, Mohannad
    Glesner, Sabine
    Herber, Paula
    2020 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2020), 2020, : 289 - 296
  • [6] Parallel program execution on a heterogeneous PC cluster using task duplication
    Kwok, Yu-Kwong
    Proceedings of the Heterogeneous Computing Workshop, HCW, 2000, : 364 - 374
  • [7] Redox-Magnetohydrodynamics, Flat Flow Profile-Guided Enzyme Assay Detection: Toward Multiple, Parallel Analyses
    Sahore, Vishal
    Fritsch, Ingrid
    ANALYTICAL CHEMISTRY, 2014, 86 (19) : 9405 - 9411
  • [8] IMPLEMENTING DOMAIN-SPECIFIC LANGUAGES FOR HETEROGENEOUS PARALLEL COMPUTING
    Lee, HyoukJoong
    Brown, Kevin J.
    Sujeeth, Arvind K.
    Chafi, Hassan
    Olukotun, Kunle
    Rompf, Tiark
    Odersky, Martin
    IEEE MICRO, 2011, 31 (05) : 42 - 52
  • [9] Code generation for energy-efficient execution of dynamic streaming task graphs on parallel and heterogeneous platforms
    Litzinger, Sebastian
    Keller, Joerg
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (02):
  • [10] QoS Management on Heterogeneous Architecture for Multiprogrammed, Parallel, and Domain-Specific Applications
    Zhang, Ying
    Zhao, Li
    Illikkal, Ramesh
    Iyer, Ravi
    Herdrich, Andrew
    Peng, Lu
    IEEE SYSTEMS JOURNAL, 2017, 11 (04): : 2096 - 2107