Towards Analytics-as-a-Service Using an In-Memory Column Database

被引:0
|
作者
Schaffner, Jan [1 ]
Eckart, Benjamin [1 ]
Schwarz, Christian [1 ]
Brunnert, Jan [1 ]
Jacobs, Dean [2 ]
Zeier, Alexander [1 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, August Bebel Str 88, D-14482 Potsdam, Germany
[2] SAP AG, D-69190 Walldorf, Germany
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For traditional data warehouses, mostly large and expensive server and storage systems are used. For small- and medium size companies, it is often too expensive to implement and run such systems. Given this situation, the SaaS model comes in handy, since these companies might opt to run their OLAP as a service. The challenge is then for the analytics service provider to minimize TCO by consolidating as many tenants onto as few servers as possible, a technique often referred to as multi-tenancy. In this article, we report on three different results on our research around building a cluster of multi-tenant main memory column databases for analytics as a service. For this purpose we ported SAP's in-memory column database TREX to run in the Amazon cloud. We evaluated the relation between data size of a tenant and number of queries per second and created a formula which allows us to estimate how many tenants with different sizes and request rates can be put on one instance for our main memory database. We discuss findings on cost/performance tradeoffs between reliably storing the data of a tenant on a single node using a highly-available network attached storage, such as Amazon EBS, vs. replication of tenant data to a secondary node where the data resides on less resilient storage. We also describe a mechanism to provide support for historical queries across older snapshots of tenant data which is lazy-loaded from Amazon's S3 near-line archiving storage and cached on the local VM disks.
引用
收藏
页码:257 / +
页数:3
相关论文
共 50 条
  • [1] Analytics on Historical Data Using a Clustered Insert-Only In-Memory Column Database
    Schaffner, Jan
    Krueger, Jens
    Mueller, Stephan
    Hofmann, Paul
    Zeier, Alexander
    [J]. 2009 IEEE 16TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1 AND 2, PROCEEDINGS, 2009, : 704 - +
  • [2] Towards Cloud-based Analytics-as-a-Service (CLAaaS) for Big Data Analytics in the Cloud
    Zulkernine, Farhana
    Martin, Patrick
    Zou, Ying
    Bauer, Michael
    Gwadry-Sridhar, Femida
    Aboulnaga, Ashraf
    [J]. 2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 62 - 69
  • [3] Towards Automatic Memory Tuning for In-Memory Big Data Analytics in Clusters
    Koliopoulos, Aris-Kyriakos
    Yiapanis, Paraskevas
    Tekiner, Firat
    Nenadic, Goran
    Keane, John
    [J]. 2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 353 - 356
  • [4] Oracle Database In-Memory: A Dual Format In-Memory Database
    Lahiri, Tirthankar
    Chavan, Shasank
    Colgan, Maria
    Das, Dinesh
    Ganesh, Amit
    Gleeson, Mike
    Hase, Sanket
    Holloway, Allison
    Kamp, Jesse
    Lee, Teck-Hua
    Loaiza, Juan
    Macnaughton, Neil
    Marwah, Vineet
    Mukherjee, Niloy
    Mullick, Atrayee
    Muthulingam, Sujatha
    Raja, Vivekanandhan
    Roth, Marty
    Soylemez, Ekrem
    Zait, Mohamed
    [J]. 2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1253 - 1258
  • [5] Understanding Bulk-Bitwise Processing In-Memory Through Database Analytics
    Perach, Ben
    Ronen, Ronny
    Kimelfeld, Benny
    Kvatinsky, Shahar
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (01) : 7 - 22
  • [6] Towards Integrating the Detection of Genetic Variants into an In-Memory Database
    Faehnrich, Cindy
    Schapranow, Matthieu-P.
    Plattner, Hasso
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [7] Using Storage Class Memory Efficiently for an In-memory Database
    Gottesman, Yonatan
    Nider, Joel
    Kat, Ronen
    Weinsberg, Yaron
    Factor, Michael
    [J]. PROCEEDINGS OF THE 9TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE (SYSTOR'16), 2016,
  • [8] From Analytics-as-a-Service to Analytics-as-a-Consumer-Service: Exploring a New Direction in Business Intelligence and Analytics Research
    Marjanovic, Olivera
    [J]. 2015 48TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2015, : 4742 - 4751
  • [9] Closing the Gap Between Experts and Novices Using Analytics-as-a-Service: An Experimental Study
    Jasmien Lismont
    Tine Van Calster
    María Óskarsdóttir
    Seppe vanden Broucke
    Bart Baesens
    Wilfried Lemahieu
    Jan Vanthienen
    [J]. Business & Information Systems Engineering, 2019, 61 : 679 - 693
  • [10] Analytics-as-a-Service (AaaS) Tool for Unstructured Data Mining
    Lomotey, Richard K.
    Deters, Ralph
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2014, : 319 - 324