Improving Application Performance by Efficiently Utilizing Heterogeneous Many-core Platforms

被引:1
|
作者
Shen, Jie [1 ]
Varbanescu, Ana Lucia [2 ]
Sips, Henk [1 ]
机构
[1] Delft Univ Technol, Parallel & Distributed Syst Grp, NL-2600 AA Delft, Netherlands
[2] Univ Amsterdam, Inst Informat, NL-1012 WX Amsterdam, Netherlands
关键词
Heterogeneous platforms; Workload partitioning; Hardware configuration; Multi-core CPUs; GPUs; Accelerators; OPENCL;
D O I
10.1109/CCGrid.2015.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Heterogeneous platforms integrating different types of processing units (such as multi-core CPUs and GPUs) are in high demand in high performance computing. Existing studies have shown that using heterogeneous platforms can improve application performance and hardware utilization. However, systematic methods to design, implement, and map applications to efficiently use heterogeneous computing resources are only very few. The goal of my PhD research is therefore to study such heterogeneous systems and propose systematic methods to allow many (classes of) applications to efficiently use them. After 3.5 years of PhD study, my contributions are (1) a thorough evaluation of a suitable programming model for heterogeneous computing; (2) a workload partitioning framework to accelerate parallel applications on heterogeneous platforms; (3) a modeling-based prediction method to determine the optimal workload partitioning; (4) a systematic approach to decide the best mapping between the application and the platform by choosing the best performing hardware configuration (Only-CPU, Only-GPU, or CPU+GPU with the workload partitioning). In the near future, I plan to apply my approach to large-scale applications and platforms to expand its usability and applicability.
引用
收藏
页码:709 / 712
页数:4
相关论文
共 50 条
  • [31] Workshop on software and hardware challenges of many-core platforms (SHCMP)
    Cohen, Albert
    Tian, Xinmin
    Chen, Wenguang
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2011, 6760 LNCS
  • [32] A Framework to Schedule Parametric Dataflow Applications on Many-Core Platforms
    Bebelis, Vagelis
    Fradet, Pascal
    Girault, Alain
    ACM SIGPLAN NOTICES, 2014, 49 (05) : 125 - 134
  • [33] A RTRM proposal for multi/many-core platforms and reconfigurable applications
    Bellasi, Patrick
    Massari, Giuseppe
    Fornaciari, William
    2012 7TH INTERNATIONAL WORKSHOP ON RECONFIGURABLE AND COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2012,
  • [34] Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures
    Zhang, Peng
    Fang, Jianbin
    Yang, Canqun
    Huang, Chun
    Tang, Tao
    Wang, Zheng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (08) : 1878 - 1896
  • [35] Parallelizing Compilation Framework for Heterogeneous Many-core Processors
    Li Y.-B.
    Zhao R.-C.
    Han L.
    Zhao J.
    Xu J.-L.
    Li Y.-Y.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (04): : 981 - 1001
  • [36] Power-Aware Performance Adaptation of Concurrent Applications in Heterogeneous Many-Core Systems
    Aalsaud, Ali
    Shafik, Rishad
    Rafiev, Ashur
    Xia, Fei
    Yang, Sheng
    Yakovlev, Alex
    ISLPED '16: PROCEEDINGS OF THE 2016 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2016, : 368 - 373
  • [37] Application Suitability Assessment for Many-Core Targets
    Newburn, Chris J.
    Sukha, Jim
    Sharapov, Ilya
    Nguyen, Anthony D.
    Miao, Chyi-Chang
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2016 INTERNATIONAL WORKSHOPS, 2016, 9945 : 319 - 338
  • [38] Performance Evaluation of OpenFOAM on Many-Core Architectures
    Brzobohaty, Tomas
    Riha, Lubomir
    Karasek, Tomas
    Kozubek, Tomas
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648
  • [39] Performance of a Hardware Scheduler for Many-Core Architecture
    Avron, Itai
    Ginosar, Ran
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 151 - 160
  • [40] A HLS-based toolflow to design next-generation heterogeneous many-core platforms with shared memory
    Burgio, Paolo
    Marongiu, Andrea
    Coussy, Philippe
    Benini, Luca
    2014 12TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2014), 2014, : 130 - 137