Quantifying the Performance Impact of Large Pages on In-Memory Big-Data Workloads

被引:0
|
作者
Park, Jinsu [1 ]
Han, Myeonggyun [1 ]
Baek, Woongki [1 ]
机构
[1] UNIST, Sch ECE, Ulsan, South Korea
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In-memory big-data processing is rapidly emerging as a promising solution for large-scale data analytics with high-performance and/or real-time requirements. In-memory big-data workloads are often hosted on servers that consist of a few multi-core CPUs and large physical memory, exhibiting the non-uniform memory access (NUMA) characteristics. While large pages are commonly known as an effective technique to reduce the performance overheads of virtual memory and widely supported across the modern hardware and system software stacks, relatively little work has been done to investigate their performance impact on in-memory big-data workloads hosted on NUMA systems. To bridge this gap, this work quantifies the performance impact of large pages on in-memory big-data workloads running on a large-scale NUMA system. Our experimental results show that large pages provide no or little performance gains over the 4KB pages when the in-memory big-data workloads process sufficiently large datasets. In addition, our experimental results show that large pages achieve higher performance gains as the dataset size of the in-memory big-data workloads decreases and the NUMA system scale increases. We also discuss the possible performance optimizations for large pages and estimate the potential performance improvements.
引用
收藏
页码:209 / 218
页数:10
相关论文
共 50 条
  • [41] Main memory controller with multiple media technologies for big data workloads
    Miguel A. Avargues
    Manel Lurbe
    Salvador Petit
    Maria E. Gomez
    Rui Yang
    Xiaoping Zhu
    Guanhao Wang
    Julio Sahuquillo
    [J]. Journal of Big Data, 10
  • [42] Performance Characterization and Acceleration of Big Data Workloads on OpenPOWER System
    Lu, Xiaoyi
    Shi, Haiyang
    Shankar, Dipti
    Panda, Dhabaleswar K.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 213 - 222
  • [43] Dynamic Data Migration in Hybrid Main Memories for In-Memory Big Data Storage
    Mai, Hai Thanh
    Park, Kyoung Hyun
    Lee, Hun Soon
    Kim, Chang Soo
    Lee, Miyoung
    Hur, Sung Jin
    [J]. ETRI JOURNAL, 2014, 36 (06) : 988 - 998
  • [44] SparkNN: A distributed in-memory data partitioning for KNN queries on big spatial data
    Al Aghbari Z.
    Ismail T.
    Kamel I.
    [J]. Data Science Journal, 2020, 19 (01) : 1 - 14
  • [45] Big-data Oriented Commuting Distribution Model and Application in Large Cities
    Liu Y.
    Zhao P.
    Lv D.
    [J]. Journal of Geo-Information Science, 2021, 23 (07): : 1185 - 1195
  • [46] Quantifying the Impact of Big Data's Variety
    Whetsel, Robert C.
    Qu, Yanzhen
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 2299 - 2303
  • [47] Large Pages on Steroids: Small Ideas to Accelerate Big Memory Applications
    Jung, Daejin
    Li, Sheng
    Ahn, Jung Ho
    [J]. IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (02) : 101 - 104
  • [48] PERFORMANCE-EFFICIENT RECOMMENDATION AND PREDICTION SERVICE FOR BIG DATA FRAMEWORKS FOCUSING ON DATA COMPRESSION AND IN-MEMORY DATA STORAGE INDICATORS
    Astsatryan, Hrachya
    Lalayan, Arthur
    Kocharyan, Aram
    Hagimont, Daniel
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2021, 22 (04): : 401 - 412
  • [49] Evaluation of SMP Shared Memory Machines for Use With In-Memory and OpenMP Big Data Applications
    Younge, Andrew J.
    Reidy, Christopher
    Henschel, Robert
    Fox, Geoffrey C.
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 1597 - 1606
  • [50] Performance Enhancement of Distributed K-Means Clustering for Big Data Analytics Through In-memory Computation
    Ketu, Shwet
    Agarwal, Sonali
    [J]. 2015 EIGHTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2015, : 318 - 324