Block as a Value for SQL over NoSQL

被引:3
|
作者
Cao, Yang [1 ]
Fan, Wenfei [1 ,2 ,3 ]
Yuan, Tengfei [1 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[2] Beihang Univ, Beijing, Peoples R China
[3] Shenzhen Univ, SICS, Shenzhen, Guangdong, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2019年 / 12卷 / 10期
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.14778/3339490.3339498
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents Zidian, a middleware for key-value (KV) stores to speed up SQL query evaluation over NoSQL. As opposed to common practice that takes a tuple id or primary key as key and the entire tuple as value, Zidian proposes a block-as-a-value model BaaV. BaaV represents a relation as keyed blocks (k, B), where k is a key of a block (a set) B of partial tuples. We extend relational algebra to BaaV. We show that under BaaV, Zidian substantially reduces data access and communication cost. We provide characterizations (sufficient and necessary conditions) for (a) result-preserving queries, i.e., queries covered by available BaaV stores, (b) scan-free queries, i. e. , queries that can be evaluated without scanning any table, and (c) bounded queries, i.e., queries that can be answered by accessing a bounded amount of data. We show that in parallel processing, Zidian guarantees (a) no scans for scan-free queries, (b) bounded communication cost for bounded queries; and (c) parallel scalability, i.e., speed up when adding processors. Moreover, Zidian can be plugged into existing SQL-over-NoSQL systems and retains horizontal scalability. Using benchmark and real-life data, we empirically verify that Zidian improves existing SQL-over-NoSQL systems by 2 orders of magnitude on average.
引用
收藏
页码:1153 / 1166
页数:14
相关论文
共 50 条
  • [21] Migrating from SQL to NOSQL Database: Practices and Analysis
    Yassine, Fatima
    Awad, Mamoun Adel
    PROCEEDINGS OF THE 2018 13TH INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY (IIT), 2018, : 58 - 62
  • [22] Performance evaluation of Twitter datasets on SQL and NoSQL DBMS
    Leung, Franklin
    Zhou, Bing
    WEB INTELLIGENCE, 2016, 14 (04) : 275 - 286
  • [23] Closing the functional and Performance Gap between SQL and NoSQL
    Liu, Zhen Hua
    Hammerschmidt, Beda
    McMahon, Doug
    Lu, Ying
    Chang, Hui Joe
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 227 - 238
  • [24] A Comparison of NoSQL and SQL Databases over the Hadoop and Spark Cloud Platforms using Machine Learning Algorithms
    Lee, Chao-Hsien
    Shih, Zhe-Wei
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [25] JackHare: a framework for SQL to NoSQL translation using MapReduce
    Chung, Wu-Chun
    Lin, Hung-Pin
    Chen, Shih-Chang
    Jiang, Mon-Fong
    Chung, Yeh-Ching
    AUTOMATED SOFTWARE ENGINEERING, 2014, 21 (04) : 489 - 508
  • [26] Associative Array Model of SQL, NoSQL, and NewSQL Databases
    Kepner, Jeremy
    Gadepally, Vijay
    Hutchison, Dylan
    Jananthan, Hayden
    Mattson, Timothy
    Samsi, Siddharth
    Reuther, Albert
    2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,
  • [27] SQL Databases v. NoSQL Databases comment
    Ernst, Johannes
    COMMUNICATIONS OF THE ACM, 2010, 53 (04) : 11 - 11
  • [28] JackHare: a framework for SQL to NoSQL translation using MapReduce
    Wu-Chun Chung
    Hung-Pin Lin
    Shih-Chang Chen
    Mon-Fong Jiang
    Yeh-Ching Chung
    Automated Software Engineering, 2014, 21 : 489 - 508
  • [29] SeCloudDB: A Unified API for Secure SQL and NoSQL Cloud Databases
    Alomari, Ebtesam
    Noaman, Amin
    PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON CLOUD AND BIG DATA COMPUTING (ICCBDC 2019), 2019, : 38 - 42
  • [30] Uniform data access platform for SQL and NoSQL database systems
    Vathy-Fogarassy, Agnes
    Hugyak, Tamas
    INFORMATION SYSTEMS, 2017, 69 : 93 - 105