In-Memory Big Data Management and Processing: A Survey

被引:229
|
作者
Zhang, Hao [1 ]
Chen, Gang [2 ]
Ooi, Beng Chin [1 ]
Tan, Kian-Lee [1 ]
Zhang, Meihui [3 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Zhejiang, Peoples R China
[3] Singapore Univ Technol & Design, Informat Syst Technol & Design Pillar, Singapore 487372, Singapore
关键词
Primary memory; DRAM; relational databases; distributed databases; query processing; PHASE-CHANGE MEMORY; HIGH-PERFORMANCE; SCALABLE SYSTEM; MULTI-CORE; COLD DATA; B+-TREES; SAP HANA; MAIN; TRANSACTION; JOINS;
D O I
10.1109/TKDE.2015.2427795
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Growing main memory capacity has fueled the development of in-memory big data management and processing. By eliminating disk I/O bottleneck, it is now possible to support interactive data analytics. However, in-memory systems are much more sensitive to other sources of overhead that do not matter in traditional I/O-bounded disk-based systems. Some issues such as fault-tolerance and consistency are also more challenging to handle in in-memory environment. We are witnessing a revolution in the design of database systems that exploits main memory as its data storage layer. Many of these researches have focused along several dimensions: modern CPU and memory hierarchy utilization, time/space efficiency, parallelism, and concurrency control. In this survey, we aim to provide a thorough review of a wide range of in-memory data management and processing proposals and systems, including both data storage systems and data processing frameworks. We also give a comprehensive presentation of important technology in memory management, and some key factors that need to be considered in order to achieve efficient in-memory data management and processing.
引用
收藏
页码:1920 / 1948
页数:29
相关论文
共 50 条
  • [1] Fast and Efficient In-Memory Big Data Processing
    Malik, Babur Hayat
    Maryam, Maliha
    Khalid, Myda
    Khlaid, Javaria
    Rehman, Naj Am Ur
    Sajjad, Syeda Iqra
    Islam, Tanveer
    Butt, Umair Ahmed
    Raza, Ali
    Nasr, M. Saad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 517 - 524
  • [2] Timo: In-Memory Temporal Query Processing for Big Temporal Data
    Zheng, Xiao
    Liu, Hou-kai
    Wei, Lin-na
    Wu, Xuan-gou
    Zhang, Zhen
    [J]. 2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 121 - 126
  • [3] Survey of In-memory Big Data Analytics and Latest Research Opportunities
    Gangarde, Rupali
    Pawar, Ambika
    Dani, Ajay
    [J]. 2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 197 - 201
  • [4] Timo: In-memory temporal query processing for big temporal data
    Zheng, Xiao
    Liu, Houkai
    Wang, Xiujun
    Wu, Xuangou
    Yu, Feng
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (13):
  • [5] In-Memory Performance for Big Data
    Graefe, Goetz
    Volos, Haris
    Kimura, Hideaki
    Kuno, Harumi
    Tucek, Joseph
    Lillibridge, Mark
    Veitch, Alistair
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (01): : 37 - 48
  • [6] LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data
    Tang, Mingjie
    Yu, Yongyang
    Malluhi, Qutaibah M.
    Ouzzani, Mourad
    Aref, Walid G.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1565 - 1568
  • [7] MemepiC: Towards a Unified In-Memory Big Data Management System
    Cai, Qingchao
    Zhang, Hao
    Guo, Wentian
    Chen, Gang
    Ooi, Beng Chin
    Tan, Kian-Lee
    Wong, Weng-Fai
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2019, 5 (01) : 4 - 17
  • [8] Massively Parallel Big Data Classification on a Programmable Processing In-Memory Architecture
    Kim, Yeseong
    Imani, Mohsen
    Gupta, Saransh
    Zhou, Minxuan
    Rosing, Tajana S.
    [J]. 2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,
  • [9] DigitalPIM: Digital-based Processing In-Memory for Big Data Acceleration
    Imani, Mohsen
    Gupta, Saransh
    Kim, Yeseong
    Zhou, Minxuan
    Rosing, Tajana
    [J]. GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 429 - 434
  • [10] Data Prefetching and Eviction Mechanisms of In-Memory Storage Systems Based on Scheduling for Big Data Processing
    Chen, Chien-Hung
    Hsia, Ting-Yuan
    Huang, Yennun
    Kuo, Sy-Yen
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (08) : 1738 - 1752