Global optimal partitioning of parallel loops for minimal data movement in limited memory embedded systems

被引:0
|
作者
Lin, J [1 ]
Lin, XL [1 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Embedded systems are often characterized by limited memory while many applications on these systems are memory-intensive. Reducing the overhead of data movement between global memory and distributed local memory in such a system is critical to the performance of these applications. In this paper, we propose a unified theoretical framework for automatically partitioning parallel loops to optimize the data movement on such systems. We first introduce the notion of data movement and build a simple but accurate data movement model to estimate the overhead of the data movement for the footprint. We then present an algorithm to derive an optimal loop partitioning to minimize the number of data movement across the loop nests. We have implemented the framework in a parallel compiler on VE16, a limited memory embedded commercial system, and the experiment results demonstrate the efficiency of the proposed method.
引用
收藏
页码:3 / 9
页数:7
相关论文
共 50 条
  • [41] An adaptive dynamic scheduling technique for parallel loops on shared memory multiprocessor systems
    Arafa, H
    Ammar, HH
    Osman, A
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2000, : 237 - 242
  • [42] Automatic partitioning technique for flash memory on Linux-based embedded systems
    Lim, Yunjae
    Nam, Young Jin
    Yoo, Geel-Sang
    Seo, Dae-Wha
    UBIQUITOUS INTELLIGENCE AND COMPUTING, PROCEEDINGS, 2007, 4611 : 93 - +
  • [43] Energy-Aware Scratch-Pad Memory Partitioning for Embedded Systems
    Balasa, Florin
    Abuaesh, Noha
    Gingu, Cristian V.
    Luican, Ilie I.
    Nasui, Doru V.
    PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2014), 2015, : 653 - +
  • [45] Global tiling for communication minimal parallelization on distributed memory systems
    Liu, Lei
    Chen, Li
    Wu, Cheng Yong
    Feng, Xiao-bing
    EURO-PAR 2008 PARALLEL PROCESSING, PROCEEDINGS, 2008, 5168 : 382 - 391
  • [46] Communication cost estimation and global data partitioning for distributed memory machines
    Prakash, SR
    Srikant, YN
    FOURTH INTERNATIONAL CONFERENCE ON HIGH-PERFORMANCE COMPUTING, PROCEEDINGS, 1997, : 480 - 485
  • [47] Optimizing Data Layout for Racetrack Memory in Embedded Systems
    Hui, Peng
    Sha, Edwin H. -M.
    Zhuge, Qingfeng
    Xu, Rui
    Wang, Han
    2023 28TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC, 2023, : 110 - 115
  • [48] Comparison of data partitioning schema of parallel pairwise alignment on shared memory system
    Akbar, Auriza Rahmad
    Sukoco, Heru
    Kusuma, Wisnu Ananta
    Telkomnika (Telecommunication Computing Electronics and Control), 2015, 13 (02) : 694 - 702
  • [49] Towards Optimal Matrix Partitioning for Data Parallel Computing on a Hybrid Heterogeneous Server
    Malik, Tania
    Lastovetsky, Alexey
    IEEE Access, 2021, 9 : 17229 - 17244
  • [50] Disturbance Aware Memory Partitioning for Parallel Data Access in STT-RAM
    Yin, Shouyi
    Xie, Zhicong
    Wei, Shaojun
    PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,