Column-oriented Database Systems

被引:91
|
作者
Abadi, Daniel J. [1 ]
Boncz, Peter A. [2 ]
Harizopoulos, Stavros [3 ]
机构
[1] Yale Univ, New Haven, CT 06520 USA
[2] CWl, Amsterdam, Netherlands
[3] HP Labs, Palo Alto, CA USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2009年 / 2卷 / 02期
关键词
D O I
10.14778/1687553.1687625
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Column-oriented database systems (column-stores) have attracted a lot of attention in the past few years. Column-stores, in a nutshell, store each database table column separately, with attribute values belonging to the same column stored contiguously, compressed, and densely packed, as opposed to traditional database systems that store entire records (rows) one after the other. Reading a subset of a table's columns becomes faster, at the potential expense of excessive disk-head seeking from column to column for scattered reads or updates. After several dozens of research papers and at least a dozen of new column-store start-ups, several questions remain. Are these a new breed of systems or simply old wine in new bottles? How easily can a major row-based system achieve column-store performance? Are column-stores the answer to effortlessly support large-scale data-intensive applications? What are the new, exciting system research problems to tackle? What are the new applications that can be potentially enabled by column-stores? In this tutorial, we present an overview of column-oriented database system technology and address these and other related questions.
引用
收藏
页码:1664 / 1665
页数:2
相关论文
共 50 条
  • [31] Data Integrity Verification in Column-Oriented NoSQL Databases
    Weintraub, Grisha
    Gudes, Ehud
    [J]. DATA AND APPLICATIONS SECURITY AND PRIVACY XXXII, DBSEC 2018, 2018, 10980 : 165 - 181
  • [32] Column-Oriented Datalog Materialization for Large Knowledge Graphs
    Urbani, Jacopo
    Jacobs, Ceriel
    Kroetzsch, Markus
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 258 - 264
  • [33] Column-oriented query execution engine for OLAP based on triplet
    Zhu, Yue-An
    Zhang, Yan-Song
    Zhou, Xuan
    Wang, Shan
    [J]. Ruan Jian Xue Bao/Journal of Software, 2014, 25 (04): : 753 - 767
  • [34] VParC: A Compression Scheme for Numeric Data in Column-Oriented Databases
    Yan, Ke
    Zhu, Hong
    Lu, Kevin
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (01) : 1 - 11
  • [35] A column-oriented optimization approach for the generation of correlated random vectors
    Jorge A. Sefair
    Oscar Guaje
    Andrés L. Medaglia
    [J]. OR Spectrum, 2021, 43 : 777 - 808
  • [36] Efficient column-oriented processing for mutual subspace skyline queries
    Jiang, Tao
    Zhang, Bin
    Lin, Dan
    Gao, Yunjun
    LI, Qing
    [J]. SOFT COMPUTING, 2020, 24 (20) : 15427 - 15445
  • [37] Logical Schema for Data Warehouse on Column-Oriented NoSQL Databases
    Boussahoua, Mohamed
    Boussaid, Omar
    Bentayeb, Fadila
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT II, 2017, 10439 : 247 - 256
  • [38] A column-oriented optimization approach for the generation of correlated random vectors
    Sefair, Jorge A.
    Guaje, Oscar
    Medaglia, Andres L.
    [J]. OR SPECTRUM, 2021, 43 (03) : 777 - 808
  • [39] Impact of Data Compression on the Performance of Column-oriented Data Stores
    Mladenova, Tsvetelina
    Kalmukov, Yordan
    Marinov, Milko
    Valova, Irena
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 416 - 421
  • [40] ColumnSGD: A Column-oriented Framework for Distributed Stochastic Gradient Descent
    Zhang, Zhipeng
    Wu, Wentao
    Jiang, Jiawei
    Yu, Lele
    Cui, Bin
    Zhang, Ce
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1513 - 1524