Application Suitability Assessment for Many-Core Targets

被引:0
|
作者
Newburn, Chris J. [1 ]
Sukha, Jim [1 ]
Sharapov, Ilya [1 ]
Nguyen, Anthony D. [1 ]
Miao, Chyi-Chang [1 ]
机构
[1] Intel Corp, Hudson, MA 01749 USA
关键词
Many-core; Performance; Characterization; Code modernization;
D O I
10.1007/978-3-319-46079-6_23
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many-core hardware platforms offer a tremendous opportunity for scaling up performance, but not all codes that run on these platforms have been modernized sufficiently to fully utilize the hardware. Assessing whether a code will effectively utilize a given platform can be challenging, particularly for new or potential future platforms where native execution on real hardware is not possible. In this case, one typically relies on architecture simulators and other workload characterization tools, which are often not user-friendly for developers who want to do a quick initial assessment of an application's suitability for a many-core architecture. To help address this challenge, we present QMSprof, a tool and a set of analyses for an initial assessment of the suitability of a set of applications for a simulated extremely-parallel many-core target. QMSprof automates the process of running a suite of workload binaries through Intel (R) Software Development Emulator (SDE) and the Sniper multi-core simulator and extracting high-level summary statistics. The tool generates comparative plots summarizing key metrics across the workload suite, including the mix of vector and nonvector instructions, scalability with increasing thread count, memory bandwidth utilization, and statistics on cache misses and working set size. These summary metrics are designed to aid performance tuners in selecting promising codes for a many-core target and in pinpointing opportunities for additional tuning. To illustrate the utility of our tool, we also describe some sample results from characterizing applications on a hypothetical many-core architecture.
引用
收藏
页码:319 / 338
页数:20
相关论文
共 50 条
  • [41] Top-down profiling of application specific many-core neuromorphic platforms
    Urgese, Gianvito
    Barchi, Francesco
    Macii, Enrico
    2015 IEEE 9TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANYCORE SYSTEMS-ON-CHIP (MCSOC), 2015, : 127 - 134
  • [42] Performance Evaluation of OpenFOAM on Many-Core Architectures
    Brzobohaty, Tomas
    Riha, Lubomir
    Karasek, Tomas
    Kozubek, Tomas
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648
  • [43] Improved scheduler for multi-core many-core systems
    Kumar, Neetesh
    Vidyarthi, Deo Prakash
    COMPUTING, 2014, 96 (11) : 1087 - 1110
  • [44] Runtime Energy Management for Many-Core Systems
    Martins, Andre L. M.
    Sant'Ana, Anderson C.
    Moraes, Fernando G.
    23RD IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS CIRCUITS AND SYSTEMS (ICECS 2016), 2016, : 380 - 383
  • [45] Reconfigurable Many-Core Processor with Cache Coherence
    Han, Xing
    Jiang, Jiang
    Fu, Yuzhuo
    Wang, Chang
    COMPUTER ENGINEERING AND TECHNOLOGY, NCCET 2013, 2013, 396 : 198 - 207
  • [46] Scalable Many-Core Algorithms for Tridiagonal Solvers
    Balogh, Gabor D.
    Flynn, Tobias S.
    Laizet, Sylvain
    Mudalige, Gihan R.
    Reguly, Istan Z.
    COMPUTING IN SCIENCE & ENGINEERING, 2022, 24 (01) : 26 - 35
  • [47] Graph Reachability on Parallel Many-Core Architectures
    Quer, Stefano
    Calabrese, Andrea
    COMPUTATION, 2020, 8 (04) : 1 - 26
  • [48] Performance of a Hardware Scheduler for Many-Core Architecture
    Avron, Itai
    Ginosar, Ran
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 151 - 160
  • [49] Synchronization Strategies on Many-Core SMT Systems
    Navarro-Torres, Agustin
    Alastruey-Benede, Jesus
    Ibanez-Marin, Pablo
    Carpen-Amarie, Maria
    2021 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2021), 2021, : 54 - 63
  • [50] Integration of AES on Heterogeneous Many-Core system
    Anwar, Hassan
    Daneshtalab, Masoud
    Ebrahimi, Masoumeh
    Ramirez, Marco
    Plosila, Juha
    Tenhunen, Hannu
    2014 22ND EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2014), 2014, : 424 - 427