Skeletons and asynchronous RPC for embedded data and task parallel image processing

被引:2
|
作者
Caarls, Wouter [1 ]
Jonker, Pieter
Corporaal, Henk
机构
[1] Delft Univ Technol, Quantitat Imaging Grp, Delft, Netherlands
[2] Eindhoven Univ Technol, Fac Elect Engn, NL-5600 MB Eindhoven, Netherlands
来源
关键词
design space exploration; heterogeneous architectures; constrained architectures; algorithmic skeletons; remote procedure call; futures; run-time scheduling;
D O I
10.1093/ietisy/e89-d.7.2036
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Developing embedded parallel image processing applications is usually a very hardware-dependent process, often using the single instruction multiple data (SIMD) paradigm, and requiring deep knowledge of the processors used. Furthermore, the application is tailored to a specific hardware platform, and if the chosen hardware does not meet the requirements, it must be rewritten for a new platform. We have proposed the use of design space exploration [9] to find the most suitable hardware platform for a certain application. This requires a hardware-independent program, and we use algorithmic skeletons [5] to achieve this, while exploiting the data parallelism inherent to low-level image processing. However, since different operations run best on different kinds of processors, we need to exploit task parallelism as well. This paper describes how we exploit task parallelism using an asynchronous remote procedure call (RPC) system, optimized for low-memory and sparsely connected systems such as smart cameras. It uses a futures [16]-like model to present a normal imperative C-interface to the user in which the skeleton calls are implicitly parallelized and pipelined. Simulation provides the task dependency graph and performance numbers for the mapping, which can be done at run time to facilitate data dependent branching. The result is an easy to program, platform independent framework which shields the user from the parallel implementation and mapping of his application, while efficiently utilizing on-chip memory and interconnect bandwidth.
引用
收藏
页码:2036 / 2043
页数:8
相关论文
共 50 条
  • [1] A data and task parallel image processing environment
    Nicolescu, C
    Jonker, P
    [J]. PARALLEL COMPUTING, 2002, 28 (7-8) : 945 - 965
  • [2] Parallel embedded processor architecture for FPGA-based image processing using parallel software skeletons
    Hanen Chenini
    Jean Pierre Dérutin
    Romuald Aufrère
    Roland Chapuis
    [J]. EURASIP Journal on Advances in Signal Processing, 2013 (1)
  • [3] Skeletons for parallel image processing:: an overview of the SKIPPER project
    Sérot, J
    Ginhac, D
    [J]. PARALLEL COMPUTING, 2002, 28 (12) : 1685 - 1708
  • [4] An embedded parallel system for image processing
    Duan, Zongtao
    Duan, Zongyuan
    Feng, Xingle
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2007, : 817 - 820
  • [5] Towards Fully User Transparent Task and Data Parallel Image Processing
    Lemeire, Jan
    Zhao, Yan
    Schelkens, Peter
    De Backer, Steve
    Cornelissen, Frans
    Torfs, Bert
    [J]. 2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 626 - +
  • [6] A data and task parallel image processing environment for distributed memory systems
    Nicolescu, C
    Jonker, P
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2001, : 39 - 44
  • [7] PARALLEL ASYNCHRONOUS SYSTEMS AND IMAGE-PROCESSING ALGORITHMS
    COON, DD
    PERERA, AGU
    [J]. VISUAL INFORMATION PROCESSING FOR TELEVISION AND TELEROBOTICS, 1989, 3053 : 191 - 201
  • [8] Hardware Implementation of Skeletonization Algorithm for Parallel Asynchronous Image Processing
    Lopich, Alexey
    Dudek, Piotr
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2009, 56 (01): : 91 - 103
  • [9] Hardware Implementation of Skeletonization Algorithm for Parallel Asynchronous Image Processing
    Alexey Lopich
    Piotr Dudek
    [J]. Journal of Signal Processing Systems, 2009, 56 : 91 - 103
  • [10] A Parallel Language for Embedded Real Time Image Processing
    Duan Zongtao
    Zhang Yanni
    Duan Zongyuan
    [J]. 2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 708 - +