Scalable Power Control for Many-Core Architectures Running Multi-threaded Applications

被引:0
|
作者
Ma, Kai [1 ]
Li, Xue [1 ]
Chen, Ming [1 ]
Wang, Xiaorui [1 ]
机构
[1] Univ Tennessee, Dept Elect Engn & Comp Sci, Knoxville, TN 37996 USA
关键词
Chip multiprocessor; many-core architecture; power control; power capping; thread criticality; scalability; control theory;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Optimizing the performance of a multi-core microprocessor within a power budget has recently received a lot of attention. However, most existing solutions are centralized and cannot scale well with the rapidly increasing level of core integration. While a few recent studies propose power control algorithms for many-core architectures, those solutions assume that the workload of every core is independent and therefore cannot effectively allocate power based on thread criticality to accelerate multi-threaded parallel applications, which are expected to be the primary workloads of many-core architectures. This paper presents a scalable power control solution for many-core microprocessors that is specifically designed to handle realistic workloads, i.e., a mixed group of single-threaded and multi-threaded applications. Our solution features a three-layer design. First, we adopt control theory to precisely control the power of the entire chip to its chip-level budget by adjusting the aggregated frequency of all the cores on the chip. Second, we dynamically group cores running the same applications and then partition the chip-level aggregated frequency quota among different groups for optimized overall microprocessor performance. Finally, we partition the group-level frequency quota among the cores in each group based on the measured thread criticality for shorter application completion time. As a result, our solution can optimize the microprocessor performance while precisely limiting the chip-level power consumption below the desired budget. Empirical results on a 12-core hardware testbed show that our control solution can provide precise power control, as well as 17% and 11% better application performance than two state-of-the-art solutions, on average, for mixed PARSEC and SPEC benchmarks. Furthermore, our extensive simulation results for 32, 64, and 128 cores, as well as overhead analysis for up to 4,096 cores, demonstrate that our solution is highly scalable to many-core architectures.
引用
收藏
页码:449 / 460
页数:12
相关论文
共 50 条
  • [31] RTL Test Generation on Multi-Core and Many-Core Architectures
    Varadarajan, Aravind Krishnan
    Hsiao, Michael S.
    2019 32ND INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2019 18TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2019, : 100 - 105
  • [32] Parallel HEVC Decoding on Multi- and Many-core Architectures A Power and Performance Analysis
    Chi, Chi Ching
    Alvarez-Mesa, Mauricio
    Lucas, Jan
    Juurlink, Ben
    Schierl, Thomas
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2013, 71 (03): : 247 - 260
  • [33] Designing Applications for Heterogeneous Many-Core Architectures with the FlexTiles Platform
    Janssen, Benedikt
    Schwiegelshohn, Fynn
    Koedam, Martijn
    Duhem, Francois
    Masing, Leonard
    Werner, Stephan
    Huriaux, Christophe
    Courtay, Antoine
    Wheatley, Emilie
    Goossens, Kees
    Lemonnier, Fabrice
    Millet, Philippe
    Becker, Juergen
    Sentieys, Olivier
    Huebner, Michael
    PROCEEDINGS INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS - ARCHITECTURES, MODELING AND SIMULATION (SAMOS XV), 2015, : 254 - 261
  • [34] Methodologies for the WCET Analysis of Parallel Applications on Many-core Architectures
    Nelis, Vincent
    Yomsi, Patrick Meumeu
    Pinho, Luis Miguel
    2015 EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2015, : 748 - 755
  • [35] On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures
    Pennycook, S. J.
    Hammond, S. D.
    Mudalige, G. R.
    Wright, S. A.
    Jarvis, S. A.
    COMPUTER JOURNAL, 2012, 55 (02): : 138 - 153
  • [36] Emerging Applications for Multi/Many-Core Processors
    Lee, Victor W.
    Chen, Yen-Kuang
    Debuy, Pradeep
    2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 1524 - 1527
  • [37] Efficient String Sorting on Multi- and Many-Core Architectures
    Drozd, Aleksandr
    Pericas, Miquel
    Matsuoka, Satoshi
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 637 - 644
  • [38] TokenSmart: Distributed, Scalable Power Management in the Many-core Era
    Shah, Parth
    Shenoy, Ranjal Gautham
    Srinivasan, Vaidyanathan
    Bose, Pradip
    Buyuktosunoglu, Alper
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2022, 20 (01)
  • [39] TokenSmart: Distributed, Scalable Power Management in the Many-Core Era
    Shah, Parth
    Shenoy, Ranjal Gautham
    Srinivasan, Vaidyanathan
    Bose, Pradip
    Buyuktosunoglu, Alper
    IEEE COMPUTER ARCHITECTURE LETTERS, 2021, 20 (01) : 42 - 45
  • [40] A polyphase filter for many-core architectures
    Adamek, K.
    Novotny, J.
    Armour, W.
    ASTRONOMY AND COMPUTING, 2016, 16 : 1 - 16