Reducing Last Level Cache Pollution in NUMA Multicore Systems for Improving Cache Performance

被引:0
|
作者
An, Deukhyeon [1 ]
Kim, Jeehong [1 ]
Han, JungHyun [2 ]
Eom, Young Ik [1 ]
机构
[1] Sungkyunkwan Univ, Coll Informat & Commun Eng, 2066 Seobu Ro, Suwon 440746, Gyeong Gi Do, South Korea
[2] Korea Univ, Coll Informat & Commun, Seoul 136701, South Korea
基金
新加坡国家研究基金会;
关键词
Cache Pollution; Cache Performance; Last Level Cache; NUMA Scheduling; Task Characteristics; I/O Intensive Task;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Non-uniform memory architecture (NUMA) system has numerous nodes with shared last level cache (LLC). Their shared LLC has brought many benefits in the cache utilization. However, LLC can be seriously polluted by tasks that cause huge I/O traffic for a long time since inclusive cache architecture of LLC replaces valid cache line by back-invalidate. Many research on the page coloring, partitioning, and pollute buffer mechanism handled this cache pollution. But, there are no scheduling approaches considering I/O-intensive tasks in NUMA systems. To address the above problem, OS scheduling that reduces cache pollution is highly needed in NUMA systems. In this paper, we propose a software-based mechanism that reduces shared LLC miss in NUMA systems. Our mechanism includes I/O traffic measurement and devil conscious scheduling. The experimental results show that LLC miss rate can be reduced up to 37.6%, and our approach improves execution time to 1.48%.
引用
收藏
页码:272 / 282
页数:11
相关论文
共 50 条
  • [1] On Improving Efficiency and Utilization of Last Level Cache in Multicore Systems
    Zahid, Yumna
    Khurshid, Hina
    Memon, Zulfiqar A.
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2018, 47 (03): : 588 - 608
  • [2] SLAM: High Performance and Energy Efficient Hybrid Last Level Cache Architecture for Multicore Embedded Systems
    Bhosale, Swapnil
    Pasricha, Sudeep
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2019,
  • [3] Managing Shared Last-Level Cache in a Heterogeneous Multicore Processor
    Mekkat, Vineeth
    Holey, Anup
    Yew, Pen-Chung
    Zhai, Antonia
    [J]. 2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 225 - 234
  • [4] Reducing cache pollution of prefetching in a small data cache
    Reungsang, P
    Park, SK
    Jeong, SW
    Roh, HL
    Lee, G
    [J]. 2001 INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD 2001, PROCEEDINGS, 2001, : 530 - 533
  • [5] Combining Process-based Cache Partitioning and Pollute Region Isolation to Improve Shared Last Level Cache Utilization on Multicore Systems
    Huang, Tao
    Wang, Jing
    Guan, Xuetao
    Zhong, Qi
    Wang, Keyi
    [J]. 2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 1153 - 1160
  • [6] Memory Management in NUMA Multicore Systems: Trapped between Cache Contention and Interconnect Overhead
    Majo, Zoltan
    Gross, Thomas R.
    [J]. ACM SIGPLAN NOTICES, 2011, 46 (11) : 11 - 32
  • [7] Partitioning the Last Level Cache to Improve Performance
    Yu, Baozhong
    Qu, Zening
    Ma, Jianliang
    Chen, Tianzhou
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (11A): : 4525 - 4532
  • [8] Explicit Non-reusable Page Cache Management to Minimize Last Level Cache Pollution
    Kim, Jongwon
    Jeong, Jinkyu
    Kim, Hwanju
    Lee, Joonwon
    [J]. 2011 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY (ICCIT), 2012, : 567 - 570
  • [9] An Effective Measurement Technique of Level-2 Cache Performance for Multicore Embedded Systems
    Mridh, Muhammad F.
    Asaduzzaman, Abu
    Saha, Aloke K.
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2013,
  • [10] Lightweight Dynamic Partitioning For Last Level Cache Of Multicore Processor On Real System
    Zhang, Ludan
    Liu, Yi
    Wang, Rui
    Qian, Depei
    [J]. 2012 13TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS, AND TECHNOLOGIES (PDCAT 2012), 2012, : 33 - 38