Full-System Simulation of Mobile CPU/GPU Platforms

被引:5
|
作者
Kaszyk, Kuba [1 ]
Wagstaff, Harry [1 ]
Spink, Tom [1 ]
Franke, Bjorn [1 ]
O'Boyle, Mike [1 ]
Bodint, Bruno [2 ]
Uhrenholt, Henrik [3 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9AB, Midlothian, Scotland
[2] Natl Univ Singapore, Yale NUS Coll, Sch Comp, Singapore, Singapore
[3] Arm Sweden, Lund, Sweden
基金
英国工程与自然科学研究理事会;
关键词
Computer simulation;
D O I
10.1109/ISPASS.2019.00015
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graphics Processing Units (GPUs) critically rely on a complex system software stack comprising kernel- and user-space drivers and Just-in-time (JIT) compilers. Yet, existing GPU simulators typically abstract away details of the software stack and GPU instruction set. Partly, this is because GPU vendors rarely release sufficient information about their latest GPU products. However, this is also due to the lack of an integrated CPU/GPU simulation framework, which is complete and powerful enough to drive the complex GPU software environment. This has led to a situation where research on GPU architectures and compilers is largely based on outdated or greatly simplified architectures and software stacks, undermining the validity of the generated results. In this paper we develop a full-system system simulation environment for a mobile platform, which enables users to run a complete and unmodified software stack for a state-of-the-art mobile Arm CPU and Mali-G71 GPU powered device. We validate our simulator against a hardware implementation and Arm's stand-alone GPU simulator, achieving 100% architectural accuracy across all available toolchains. We demonstrate the capability of our GPU simulation framework by optimizing an advanced Computer Vision application using simulated statistics unavailable with other simulation approaches or physical GPU implementations. We demonstrate that performance optimizations for desktop GPUs trigger bottlenecks on mobile GPUs, and show the importance of efficient memory use.
引用
收藏
页码:68 / 78
页数:11
相关论文
共 50 条
  • [21] Cycle-Accurate Full System Simulation for CPU plus GPU plus HBM Computing Platform
    Paik, Yoonah
    Han, Miseon
    Choi, Kyu Hyun
    Kim, Minseong
    Kim, Seon Wook
    2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2018, : 342 - 343
  • [22] A Loosely-Coupled Full-System Multicore Simulation Framework
    Zhang, Weihua
    Wang, Haojun
    Lu, Yunping
    Chen, Haibo
    Zhao, Wenyun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (06) : 1566 - 1578
  • [23] System-Level Power & Energy Estimation Methodology and Optimization Techniques for CPU-GPU based Mobile Platforms
    Kumar Rethinagiri, Santhosh
    Palomar, Oscar
    Arias Moreno, Javier
    Yalcin, Gulay
    Unsal, Osman
    Cristal, Adrian
    2014 IEEE 12TH SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA (ESTIMEDIA), 2014, : 118 - 127
  • [24] A Efficient Algorithm for Molecular Dynamics Simulation on Hybrid CPU-GPU Computing Platforms
    Li, Dapu
    Ai, Wei
    Ye, Yu
    Liang, Jie
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1357 - 1363
  • [25] A Control-Theoretic Approach for Energy Efficient CPU-GPU Subsystem in Mobile Platforms
    Kadjo, David
    Ayoub, Raid
    Kishinevsky, Michael
    Gratz, Paul V.
    2015 52ND ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2015,
  • [26] Memory Hierarchy Characterization of NoSQL Applications through Full-System Simulation
    Colaso, Adrian
    Prieto, Pablo
    Angel Herrero, Jose
    Abad, Pablo
    Menezo, Lucia G.
    Puente, Valentin
    Angel Gregorio, Jose
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (05) : 1161 - 1173
  • [27] FAST modularization framework for wind turbine simulation: full-system linearization
    Jonkman, J. M.
    Jonkman, B. J.
    SCIENCE OF MAKING TORQUE FROM WIND (TORQUE 2016), 2016, 753
  • [28] Accelerating full-system simulation through characterizing and predicting operating system performance
    Kim, Seongbeom
    Liu, Fang
    Solihin, Yan
    Iyer, Ravi
    Zhao, Li
    Cohen, William
    ISPASS 2007: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2007, : 1 - +
  • [29] Comparison of CPU and GPU Platforms in Problems of Wave Diagnostics
    A. V. Goncharsky
    S. Y. Romanov
    S. Y. Seryozhnikov
    Lobachevskii Journal of Mathematics, 2021, 42 : 1504 - 1513
  • [30] Comparison of CPU and GPU Platforms in Problems of Wave Diagnostics
    Goncharsky, A., V
    Romanov, S. Y.
    Seryozhnikov, S. Y.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2021, 42 (07) : 1504 - 1513