OpenMDSP: Extending OpenMP to Program Multi-Core DSPs

被引:0
|
作者
Jiang-Zhou He
Wen-Guang Chen
Guang-Ri Chen
Wei-Min Zheng
Zhi-Zhong Tang
Han-Dong Ye
机构
[1] Tsinghua University,Department of Computer Science and Technology
[2] Huawei Technologies Co. Ltd.,undefined
关键词
OpenMP; multi-core digital signal processor; data parallelism; Long Term Evolution;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-core digital signal processors (DSPs) are widely used in wireless telecommunication, core network transcoding, industrial control, and audio/video processing technologies, among others. In comparison with general-purpose multi-processors, multi-core DSPs normally have a more complex memory hierarchy, such as on-chip core-local memory and non-cache-coherent shared memory. As a result, efficient multi-core DSP applications are very difficult to write. The current approach used to program multi-core DSPs is based on proprietary vendor software development kits (SDKs), which only provide low-level, non-portable primitives. While it is acceptable to write coarse-grained task-level parallel code with these SDKs, writing fine-grained data parallel code with SDKs is a very tedious and error-prone approach. We believe that it is desirable to possess a high-level and portable parallel programming model for multi-core DSPs. In this paper, we propose OpenMDSP, an extension of OpenMP designed for multi-core DSPs. The goal of OpenMDSP is to fill the gap between the OpenMP memory model and the memory hierarchy of multi-core DSPs. We propose three classes of directives in OpenMDSP, including 1) data placement directives that allow programmers to control the placement of global variables conveniently, 2) distributed array directives that divide a whole array into sections and promote the sections into core-local memory to improve performance, and 3) stream access directives that promote big arrays into core-local memory section by section during parallel loop processing while hiding the latency of data movement by the direct memory access (DMA) of a DSP. We implement the compiler and runtime system for OpenMDSP on FreeScale MSC8156. The benchmarking results show that seven of nine benchmarks achieve a speedup of more than a factor of 5 when using six threads.
引用
收藏
页码:316 / 331
页数:15
相关论文
共 50 条
  • [1] OpenMDSP:Extending OpenMP to Program Multi-Core DSPs
    何江舟
    陈文光
    陈光日
    郑纬民
    汤志忠
    叶寒栋
    [J]. Journal of Computer Science & Technology, 2014, 29 (02) : 316 - 331
  • [2] OpenMDSP: Extending OpenMP to Program Multi-Core DSPs
    He, Jiang-Zhou
    Chen, Wen-Guang
    Chen, Guang-Ri
    Zheng, Wei-Min
    Tang, Zhi-Zhong
    Ye, Han-Dong
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2014, 29 (02) : 316 - 331
  • [3] Extending OpenMP to Survive the Heterogeneous Multi-Core Era
    Eduard Ayguadé
    Rosa M. Badia
    Pieter Bellens
    Daniel Cabrera
    Alejandro Duran
    Roger Ferrer
    Marc Gonzàlez
    Francisco Igual
    Daniel Jiménez-González
    Jesús Labarta
    Luis Martinell
    Xavier Martorell
    Rafael Mayo
    Josep M. Pérez
    Judit Planas
    Enrique S. Quintana-Ortí
    [J]. International Journal of Parallel Programming, 2010, 38 : 440 - 459
  • [4] Extending OpenMP to Survive the Heterogeneous Multi-Core Era
    Ayguade, Eduard
    Badia, Rosa M.
    Bellens, Pieter
    Cabrera, Daniel
    Duran, Alejandro
    Ferrer, Roger
    Gonzalez, Marc
    Igual, Francisco
    Jimenez-Gonzalez, Daniel
    Labarta, Jesus
    Martinell, Luis
    Martorell, Xavier
    Mayo, Rafael
    Perez, Josep M.
    Planas, Judit
    Quintana-Orti, Enrique S.
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2010, 38 (5-6) : 440 - 459
  • [5] Extending Synchronization Constructs in OpenMP to Exploit Pipeline Parallelism on Heterogeneous Multi-core
    Li, Shigang
    Yao, Shucai
    He, Haohu
    Sun, Lili
    Chen, Yi
    Peng, Yunfeng
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT II, 2011, 7017 : 54 - 63
  • [6] Optimizing Pointwise Convolutions on Multi-core DSPs
    Wang, Yang
    Wang, Qinglin
    Pei, Xiangdong
    Mei, Songzhu
    Liu, Jie
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT VII, 2024, 14493 : 209 - 223
  • [7] The Future of OpenMP in the Multi-Core Era
    Chapman, Barbara
    Mey, Dieter An
    [J]. PARALLEL COMPUTING: ARCHITECTURES, ALGORITHMS AND APPLICATIONS, 2008, 15 : 571 - +
  • [8] High performance dilated convolutions on multi-core DSPs
    Yang Wang
    Qinglin Wang
    Xiangdong Pei
    Songzhu Mei
    Rongchun Li
    Jie Liu
    [J]. CCF Transactions on High Performance Computing, 2024, 6 : 78 - 93
  • [9] High performance dilated convolutions on multi-core DSPs
    Wang, Yang
    Wang, Qinglin
    Pei, Xiangdong
    Mei, Songzhu
    Li, Rongchun
    Liu, Jie
    [J]. CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2024, 6 (01) : 78 - 93
  • [10] thSORT: an efficient parallel sorting algorithm on multi-core DSPs
    Yang, Mouzhi
    Zhang, Peng
    Fang, Jianbin
    Liu, Weifeng
    Huang, Chun
    [J]. CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2024, 6 (05) : 503 - 518