G-Scalar: Cost-Effective Generalized Scalar Execution Architecture for Power-Efficient GPUs

被引:10
|
作者
Liu, Zhenhong [1 ]
Gilani, Syed [2 ]
Annavaram, Murali [3 ]
Kim, Nam Sung [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] AMD, Sunnyvale, CA USA
[3] Univ Southern Calif, Los Angeles, CA USA
关键词
D O I
10.1109/HPCA.2017.51
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The GPU has provide higher throughput by integrating more execution resources into a single chip without unduly compromising power efficiency. With the power wall challenge, however, increasing the throughput will require significant improvement in power efficiency. To accomplish this goal, we propose G-Scalar, a cost-effective generalized scalar execution architecture for GPUs in this paper. G-Scalar offers two key advantages over prior architectures supporting scalar execution for only non-divergent arithmetic/logic instructions. First, G-Scalar is more power-efficient as it can also support scalar execution of divergent and special-function instructions, the fraction of which in contemporary GPU applications has notably increased. Second, G-Scalar is less expensive as it can share most of its hardware resources with register value compression, of which adoption has been strongly promoted to reduce high power consumption of accessing the large register file. Compared with the baseline and previous scalar architectures, G-Scalar improves power efficiency by 24% and 15%, respectively, at a negligible cost.
引用
收藏
页码:601 / 612
页数:12
相关论文
共 48 条
  • [21] Power Efficient and Cost-Effective Solutions for Optical OFDM Systems Using Direct Detection
    Svaluto Moreolo, Michela
    [J]. 2010 12TH INTERNATIONAL CONFERENCE ON TRANSPARENT OPTICAL NETWORKS (ICTON), 2011,
  • [22] Control Design for Efficient and Cost-Effective Distributed Fuel Cell Power Electronics System
    Mazumder, Sudip K.
    [J]. IECON 2008: 34TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-5, PROCEEDINGS, 2008, : 678 - 685
  • [23] Cost-effective low-power architecture of vestigial sideband W-CDMA system
    Kwon, S
    Kwak, J
    Roh, D
    Kim, D
    Lee, M
    [J]. PROCEEDINGS OF THE SECOND IEEE ASIA PACIFIC CONFERENCE ON ASICS, 2000, : 243 - 246
  • [24] Spectrally Efficient Fronthaul Architectures for a Cost-Effective 5G C-RAN
    Mello, D. A. A.
    Barreto, A. N.
    Barbosa, F. A.
    Osorio, C.
    Fiorani, M.
    Monti, P.
    [J]. 2016 18TH INTERNATIONAL CONFERENCE ON TRANSPARENT OPTICAL NETWORKS (ICTON), 2016,
  • [25] A Cost-effective and Energy-efficient Architecture for Die-stacked DRAM/NVM Memory Systems
    Guo, Yuhua
    Xiao, Weijun
    Liu, Qing
    He, Xubin
    [J]. 2018 IEEE 37TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2018,
  • [26] Constructing amidoxime-modified porous adsorbents with open architecture for cost-effective and efficient uranium extraction
    Li, Zhangnan
    Meng, Qinghao
    Yang, Yajie
    Zou, Xiaoqin
    Yuan, Ye
    Zhu, Guangshan
    [J]. CHEMICAL SCIENCE, 2020, 11 (18) : 4747 - 4752
  • [27] A Simple and Cost-Effective EPON-Based 4G Mobile Backhaul RAN Architecture
    Zaidi, S. R.
    Hussain, S.
    Hossain, A. S. M.
    Ellinas, G.
    Dorsinville, R.
    Ali, M. A.
    [J]. 2012 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2012,
  • [28] Area-Effective and Power-Efficient Fixed-Width Booth Multipliers Using Generalized Probabilistic Estimation Bias
    Chen, Yuan-Ho
    Li, Chung-Yi
    Chang, Tsin-Yuan
    [J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2011, 1 (03) : 277 - 288
  • [29] Towards energy-efficient and cost-effective DC nanaogrid: A novel pseudo hierarchical architecture incorporating V2G technology for both autonomous coordination and regulated power dispatching
    Yu, Hang
    Shang, Yitong
    Niu, Songyan
    Cheng, Chong
    Shao, Ziyun
    Jian, Linni
    [J]. APPLIED ENERGY, 2022, 313
  • [30] Efficient Fast Transform Processor with Cost-Effective Hardware Sharing Architecture for Multi-Standard Video Encoding
    Chang, Chia-Wei
    Hsu, Shun-Ji
    Fan, Chih-Peng
    [J]. 2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 14 - 18