A Novel High Performance and Energy Efficient NUCA Architecture for STT-MRAM LLCs With Thermal Consideration

被引:6
|
作者
Wu, Bi [1 ,2 ]
Dai, Pengcheng [1 ,2 ]
Cheng, Yuanqing [2 ]
Wang, Ying [3 ]
Yang, Jianlei [1 ,4 ]
Wang, Zhaohao [1 ,2 ]
Liu, Dijun [5 ]
Zhao, Weisheng [1 ,2 ]
机构
[1] Beihang Univ, Fert Beijing Inst, BDBC, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Microelect, Beijing 100191, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[4] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[5] China Acad Informat & Commun Technol, Beijing 100191, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
System-on-chip; Computer architecture; Magnetic tunneling; Transistors; Switches; Thermal sensors; Organizations; Cache; data migration; low power; spin transfer torque magnetic memory (STT-MRAM); thermal gradient; MAGNETIC TUNNEL-JUNCTION; MEMORY; CACHE; MODEL;
D O I
10.1109/TCAD.2019.2897707
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As the speed gap of the modern processor and the off-chip main memory enlarges, on-chip cache capacity increases to sustain the performance scaling. As a result, the cache power occupies a large portion of the total power budget. Spin transfer torque magnetic memory (STT-MRAM) is proposed as a promising solution for the low power cache design due to its high integration density and ultralow leakage power. Nevertheless, the high write power and latency of STT-MRAM become new barriers for the commercialization of this emerging technology. In this paper, we investigate the thermal effect on the access performance of STT-MRAM, and observe that the temperature can affect the write delay and energy significantly. Then, we explore the nonuniform cache access (NUCA) design of the chip-multiprocessors with STT-MRAM-based last level cache (LLC). A thermal aware data migration policy, called "Thermosiphon," which takes advantage of the thermal property of STT-MRAM, is proposed to reduce the LLC write energy. This policy splits the LLC into different regions dynamically based on the thermal distribution monitored by thermal sensors available on-chip, and adaptively migrates write intensive data among different thermal regions considering the thermal gradient. Compared to the conventional NUCA design, our proposed design can save 41.2% write energy at most and 13.01% on average with negligible hardware overhead.
引用
收藏
页码:803 / 815
页数:13
相关论文
共 50 条
  • [41] Energy-Efficient STT-MRAM based Digital PIM supporting Vertical Computations Using Sense Amplifier
    Kim, Yeseul
    Park, Jongsun
    2022 19TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2022, : 9 - 10
  • [42] Fully Functional Perpendicular STT-MRAM Macro Embedded in 40 nm Logic for Energy-efficient IOT Applications
    Lu, Yu
    Zhong, Tom
    Hsu, W.
    Kim, S.
    Lu, X.
    Kan, J. J.
    Park, C.
    Chen, W. C.
    Li, X.
    Zhu, X.
    Wang, P.
    Gottwald, M.
    Fatehi, J.
    Seward, L.
    Kim, J. P.
    Yu, N.
    Jan, G.
    Haq, J.
    Le, S.
    Wang, Y. J.
    Thomas, L.
    Zhu, J.
    Liu, H.
    Lee, Y. J.
    Tong, R. Y.
    Pi, K.
    Shen, D.
    He, R.
    Teng, Z.
    Lam, V.
    Annapragada, R.
    Torng, T.
    Wang, Po-Kang
    Kang, S. H.
    2015 IEEE INTERNATIONAL ELECTRON DEVICES MEETING (IEDM), 2015,
  • [43] TAM: A Computing in Memory based on Tandem Array within STT-MRAM for Energy-Efficient Analog MAC Operation
    Wang, Jinkai
    Gu, Zhengkun
    Wang, Hongyu
    Hao, Zuolei
    Zhang, Bojun
    Zhao, Weisheng
    Zhang, Yue
    2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
  • [44] A Three-Terminal Dual-Pillar STT-MRAM for High-Performance Robust Memory Applications
    Mojumder, Niladri Narayan
    Gupta, Sumeet Kumar
    Choday, Sri Harsha
    Nikonov, Dmitri E.
    Roy, Kaushik
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2011, 58 (05) : 1508 - 1516
  • [45] Area and Energy Efficient Short-Circuit-Logic-Based STT-MRAM Crossbar Array for Binary Neural Networks
    Wang, Chao
    Wang, Zhaohao
    Zhang, Zhongkui
    Zhang, Youguang
    Zhao, Weisheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1386 - 1390
  • [46] Low-power cache memory with state-of-the-art STT-MRAM for high-performance processors
    Takeda, Susumu
    Noguchi, Hiroki
    Nomura, Kumiko
    Fujita, Shinobu
    Miwa, Shinobu
    Arima, Eishi
    Nakada, Takashi
    Nakamura, Hiroshi
    2015 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2015, : 153 - 154
  • [47] Enablement of STT-MRAM as last level cache for the high performance computing domain at the 5nm node
    Sakhare, S.
    Perumkunnil, M.
    Bao, T. Huynh
    Rao, S.
    Kim, W.
    Crotti, D.
    Yasin, F.
    Couet, S.
    Swerts, J.
    Kundu, S.
    Yakimets, D.
    Baert, R.
    Oh, H. R.
    Spessot, A.
    Mocuta, A.
    Kar, G. Sankar
    Furnemont, A.
    2018 IEEE INTERNATIONAL ELECTRON DEVICES MEETING (IEDM), 2018,
  • [48] Top-pinned STT-MRAM devices with high thermal stability hybrid free layers for high density memory applications
    Liu, E.
    Swerts, J.
    Vaysset, A.
    Wu, Y.
    Couet, S.
    Mertens, S.
    Rao, S.
    Kim, W.
    Van Elshocht, S.
    De Boeck, J.
    Kar, G. S.
    2018 IEEE INTERNATIONAL MAGNETIC CONFERENCE (INTERMAG), 2018,
  • [49] STAIR: High Reliable STT-MRAM Aware Multi-Level I/O Cache Architecture by Adaptive ECC Allocation
    Hadizadeh, Mostafa
    Cheshmikhani, Elham
    Asadi, Hossein
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 1484 - 1489
  • [50] Normally-OFF STT-MRAM Cache with Zero-Byte Compression for Energy Efficient Last-Level Caches
    Oboril, Fabian
    Hameed, Fazal
    Bishnoi, Rajendra
    Ahari, Ali
    Naeimi, Helia
    Tahoori, Mehdi
    ISLPED '16: PROCEEDINGS OF THE 2016 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2016, : 236 - 241