A Performance Study on Large-Scale Data Analytics Using Disk-Based and In-Memory Database Systems

被引:0
|
作者
Chao, Pingfu [1 ]
He, Dan [1 ]
Sadiq, Shazia [1 ]
Zheng, Kai [2 ]
Zhou, Xiaofang [1 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, St Lucia, Qld, Australia
[2] Soochow Univ, Adv Data Analyt Lab, Suzhou, Peoples R China
关键词
Data Warehousing; Performance Evaluation; Relational Database; In-memory Database;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the significant increase in memory size, in-memory database systems are becoming the dominant way of dealing with large scale data analytics as compared to the traditional disk-based systems such as data warehouses. Due to the significant differences in both physical and logical designs, these two systems show totally different characteristics on massive data analytic workload. In order to address the difference and technical reasons behind, we contrast the performance between disk-based data warehousing and in-memory database systems by comparing two state-of-the-art commercial systems using a large-scale real transportation dataset. This independent performance study reveals several interesting insights. Experimental evaluation shows that the in-memory system can achieve competitive performance on most data analytics queries with less model maintenance cost and more flexibility, but it is not capable in other cases. We summarise the results of our study and provide guidelines on how to select an appropriate system for a given data analytics task.
引用
收藏
页码:247 / 254
页数:8
相关论文
共 50 条
  • [41] A method for using legacy data for metamodel-based design of large-scale systems
    Srivastava, A
    Hacker, K
    Lewis, K
    Simpson, TW
    [J]. STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2004, 28 (2-3) : 146 - 155
  • [42] A method for using legacy data for metamodel-based design of large-scale systems
    A. Srivastava
    K. Hacker
    K. Lewis
    T.W. Simpson
    [J]. Structural and Multidisciplinary Optimization, 2004, 28 : 146 - 155
  • [43] Optimizing Data Aggregation by Leveraging the Deep Memory Hierarchy on Large-scale Systems
    Tessier, Francois
    Gressier, Paul
    Vishwanath, Venkatram
    [J]. INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 229 - 239
  • [44] OPAL: High performance platform for large-scale privacy-preserving location data analytics
    Oehmichen, Axel
    Jain, Shubham
    Gadotti, Andrea
    de Montjoye, Yves-Alexandre
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1332 - 1342
  • [45] DOE Global Energy Storage Database - A Platform for Large Scale Data Analytics and System Performance Metrics
    Hernandez, Jacquelynne
    Gyuk, Imre
    Christensen, Cedric
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY (POWERCON), 2016,
  • [46] In-Memory Distributed Mosaicking for Large-Scale Remote Sensing Applications with Geo-Gridded Data Staging on Alluxio
    Ma, Yan
    Song, Jie
    Zhang, Zhixin
    [J]. REMOTE SENSING, 2022, 14 (23)
  • [47] Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction
    Prasad, Shreya
    Fard, Arash
    Gupta, Vishrut
    Martinez, Jorge
    LeFevre, Jeff
    Xu, Vincent
    Hsu, Meichun
    Roy, Indrajit
    [J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 1657 - 1668
  • [48] Large-scale data analysis on aviation accident database using different data mining techniques
    Christopher, A. B. Arockia
    Vivekanandam, V. Shunmughavel
    Anderson, A. B. Antony
    Markkandeyan, S.
    Sivakumar, V.
    [J]. AERONAUTICAL JOURNAL, 2016, 120 (1234): : 1849 - 1866
  • [49] An Interactive Web-Based System Using Cloud for Large-Scale Visual Analytics
    Kaseb, Ahmed S.
    Berry, Everett
    Rozolis, Erik
    McNulty, Kyle
    Bontrager, Seth
    Koh, Youngsol
    Lu, Yung-Hsiang
    Delp, Edward J.
    [J]. IMAGING AND MULTIMEDIA ANALYTICS IN A WEB AND MOBILE WORLD 2015, 2015, 9408
  • [50] A Latent Factor Analysis of Working Memory Measures Using Large-Scale Data
    Waris, Otto
    Soveri, Anna
    Ahti, Miikka
    Hoffing, Russell C.
    Ventus, Daniel
    Jaeggi, Susanne M.
    Seitz, Aaron R.
    Laine, Matti
    [J]. FRONTIERS IN PSYCHOLOGY, 2017, 8