Quantifying the Performance Impact of Large Pages on In-Memory Big-Data Workloads

被引:0
|
作者
Park, Jinsu [1 ]
Han, Myeonggyun [1 ]
Baek, Woongki [1 ]
机构
[1] UNIST, Sch ECE, Ulsan, South Korea
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In-memory big-data processing is rapidly emerging as a promising solution for large-scale data analytics with high-performance and/or real-time requirements. In-memory big-data workloads are often hosted on servers that consist of a few multi-core CPUs and large physical memory, exhibiting the non-uniform memory access (NUMA) characteristics. While large pages are commonly known as an effective technique to reduce the performance overheads of virtual memory and widely supported across the modern hardware and system software stacks, relatively little work has been done to investigate their performance impact on in-memory big-data workloads hosted on NUMA systems. To bridge this gap, this work quantifies the performance impact of large pages on in-memory big-data workloads running on a large-scale NUMA system. Our experimental results show that large pages provide no or little performance gains over the 4KB pages when the in-memory big-data workloads process sufficiently large datasets. In addition, our experimental results show that large pages achieve higher performance gains as the dataset size of the in-memory big-data workloads decreases and the NUMA system scale increases. We also discuss the possible performance optimizations for large pages and estimate the potential performance improvements.
引用
收藏
页码:209 / 218
页数:10
相关论文
共 50 条
  • [21] High performance solutions for big-data GWAS
    Peise, Elmar
    Fabregat-Traver, Diego
    Bientinesi, Paolo
    [J]. PARALLEL COMPUTING, 2015, 42 : 75 - 87
  • [22] Optimizing Performance and Computing Resource Management of in-memory Big Data Analytics with Disaggregated Persistent Memory
    Chen, Shouwei
    Wang, Wensheng
    Wu, Xueyang
    Fan, Zhen
    Huang, Kunwu
    Zhuang, Peiyu
    Li, Yue
    Rodero, Ivan
    Parashar, Manish
    Weng, Dennis
    [J]. 2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 21 - 30
  • [23] Using In-Memory Analytics to Quickly Crunch Big Data
    Garber, Lee
    [J]. COMPUTER, 2012, 45 (10) : 16 - 18
  • [24] Impact of Wireless Technology on Future of Big-data Industry
    Ahmad, Majid
    Rashid, Tahir
    Mishra, Durgesh Kumar
    [J]. 2014 CONFERENCE ON IT IN BUSINESS, INDUSTRY AND GOVERNMENT (CSIBIG), 2014,
  • [25] Work in Progress - In-Memory Analysis for Healthcare Big Data
    Mian, Muaz
    Teredesai, Ankur
    Hazel, David
    Pokuri, Sreenivasulu
    Uppala, Krishna
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 778 - +
  • [26] Impact of the Memory Controller on the Performance of Parallel Workloads
    Gomez Requena, Crispin
    [J]. EURO-PAR 2013: PARALLEL PROCESSING WORKSHOPS, 2014, 8374 : 423 - 432
  • [27] Memristor: The Enabler of Computation-in-Memory Architecture for Big-Data
    Hamdioui, Said
    Taouil, Mottaqiallah
    Hoang Anh Du Nguyen
    Haron, Adib
    Xie, Lei
    Bertels, Koen
    [J]. 2015 INTERNATIONAL CONFERENCE ON MEMRISTIVE SYSTEMS (MEMRISYS), 2015,
  • [28] Towards Automatic Memory Tuning for In-Memory Big Data Analytics in Clusters
    Koliopoulos, Aris-Kyriakos
    Yiapanis, Paraskevas
    Tekiner, Firat
    Nenadic, Goran
    Keane, John
    [J]. 2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 353 - 356
  • [29] Impacts of Big-Data Technologies in Enhancing CRM Performance
    Taleb, Nasser
    Salami, Mohammad
    Ali, Liaqat
    [J]. 2020 THE 6TH IEEE INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM 2020), 2020, : 257 - 263
  • [30] LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data
    Tang, Mingjie
    Yu, Yongyang
    Malluhi, Qutaibah M.
    Ouzzani, Mourad
    Aref, Walid G.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1565 - 1568