Automatic Parallelization of Array-oriented Programs for a Multi-core Machine

被引:0
|
作者
Wai-Mee Ching
Da Zheng
机构
[1] Zhejiang Normal University,Department of Computer Science
[2] Johns Hopkins University,Department of Computer Science
关键词
Parallel Machine; Parse Tree; Outer Product; Primitive Function; Good Speedup;
D O I
暂无
中图分类号
学科分类号
摘要
We present the work on automatic parallelization of array-oriented programs for multi-core machines. Source programs written in standard APL are translated by a parallelizing APL-to-C compiler into parallelized C code, i.e. C mixed with OpenMP directives. We describe techniques such as virtual operations and data-partitioning used to effectively exploit parallelism structured around array-primitives. We present runtime performance data, showing the speedup of the resulting parallelized code, using different numbers of threads and different problem sizes, on a 4-core machine, for several examples.
引用
收藏
页码:514 / 531
页数:17
相关论文
共 50 条
  • [41] Performance optimisation of sequential programs on multi-core processors
    Tristram, Waide
    Bradshaw, Karen
    PROCEEDINGS OF THE SOUTH AFRICAN INSTITUTE FOR COMPUTER SCIENTISTS AND INFORMATION TECHNOLOGISTS CONFERENCE, 2012, : 119 - 128
  • [42] Proposal and Validation of an Adaptable Array for Multi-Core Processors
    Silva Junior, Francisco Carlos
    Siva, Ivan Saraiva
    PROCEEDINGS OF THE 2016 XLII LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2016,
  • [43] Suffix Array Performance Analysis for Multi-Core Platforms
    Gil-Costa, Veronica
    Ochoa, Cesar
    Printista, A. Marcela
    COMPUTACION Y SISTEMAS, 2013, 17 (03): : 391 - 399
  • [44] Parallelization Strategies and Performance Analysis of Media Mining Applications on Multi-Core Processors
    Li, Wenlong
    Tong, Xiaofeng
    Wang, Tao
    Zhang, Yimin
    Chen, Yen-Kuang
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2009, 57 (02): : 213 - 228
  • [45] Parallelization Strategies of the Canny Edge Detector for Multi-core CPUs and Many-core GPUs
    Ben Cheikh, Taieb Lamine
    Beltrame, Giovanni
    Nicolescu, Gabriela
    Cheriet, Farida
    Tahar, Sofiene
    2012 IEEE 10TH INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS), 2012, : 49 - 52
  • [46] Parallelization and Performance Analysis of Video Feature Extractions on Multi-Core Based Systems
    Zhang, Qi
    Chen, Yurong
    Li, Jianguo
    Zhang, Yimin
    Xu, Yinlong
    2007 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPP), 2007, : 1 - +
  • [47] Hybrid Vectorization and Parallelization for Matrix-Matrix Multiplication on Multi-core Platform
    Keatkaew, Thanit
    Woradit, Kampol
    Champrasert, Paskom
    2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 286 - 289
  • [48] KPN-based parallelization of Wu–Manber algorithm on multi-core machines
    Monther Aldwairi
    Abdulmughni Y. Hamzah
    Moath Jarrah
    Cluster Computing, 2020, 23 : 725 - 734
  • [49] An Automatic Task Partition Method for Multi-core System
    Jing, Minge
    Huang, Yujie
    Fan, Yibo
    Xue, Xiaoyong
    Zeng, Xiaoyang
    Yu, Zhiyi
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [50] Parallelization Strategies for Fast Factorized Backprojection SAR on Embedded Multi-Core Architectures
    Wielage, M.
    Cholewa, F.
    Riggers, C.
    Pirsch, P.
    Blume, H.
    2017 IEEE INTERNATIONAL CONFERENCE ON MICROWAVES, ANTENNAS, COMMUNICATIONS AND ELECTRONIC SYSTEMS (COMCAS), 2017, : 234 - 239