Optimization of Column-oriented Storage Compression Strategy Based on Hbase

被引:0
|
作者
Sun, Jingchao [1 ]
Lu, Tianliang [1 ]
机构
[1] Peoples Publ Secur Univ China, Sch Informat Technol & Network Secur, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Column-oriented storage; Data compression; HBase; Selection method of compression strategy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the problem of high learning cost and low compression efficiency caused by large data dispersion, this paper presents a sorted-based compression strategy selection method for HBase. Firstly, a method to sort the data in each column is designed according to the characteristics of HBase to strengthen the data compaction. Secondly, according to the characteristics of the data, a column-based compression strategy is proposed to recommend the compression scheme. Experiments on TPC-DS standard dataset show its competitive performance as compared with the other state-of-the-art methods.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [31] Startable: Multidimensional Modelling for Column-Oriented NoSQL
    Ferreira, Leandro Mendes
    Alves-Souza, Solange Nice
    da Silva, Luciana Maria
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 21 - 30
  • [32] Storing and Indexing RDF Data in a Column-Oriented DBMS
    Wang, Xin
    Wang, Shuyi
    Du, Pufeng
    Feng, Zhiyong
    2010 2ND INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS PROCEEDINGS (DBTA), 2010,
  • [33] A Cost-Aware Strategy for Merging Differential Stores in Column-Oriented In-Memory DBMS
    Huebner, Florian
    Boese, Joos-Hendrik
    Krueger, Jens
    Tosun, Cafer
    Zeier, Alexander
    Plattner, Hasso
    ENABLING REAL-TIME BUSINESS INTELLIGENCE, BIRTE 2011, 2012, 126 : 38 - 52
  • [34] Implementation of Multidimensional Databases in Column-Oriented NoSQL Systems
    Chevalier, Max
    El Malki, Mohammed
    Kopliku, Arlind
    Teste, Olivier
    Tournier, Ronan
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2015, 2015, 9282 : 79 - 91
  • [35] Fast, secure encryption for indexing in a column-oriented DBMS
    Ge, Tingjian
    Zdonik, Stan
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 651 - +
  • [36] Tool for materializing OWL ontologies in a column-oriented database
    Reyes-Alvarez, Liudmila
    del Mar Roldan-Garcia, Maria
    Aldana-Montes, Jose F.
    SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (01): : 100 - 119
  • [37] Data Integrity Verification in Column-Oriented NoSQL Databases
    Weintraub, Grisha
    Gudes, Ehud
    DATA AND APPLICATIONS SECURITY AND PRIVACY XXXII, DBSEC 2018, 2018, 10980 : 165 - 181
  • [38] DataCommandr: Column-oriented Data Integration, Transformation and Analysis
    Savinov, Alexandr
    IOTBD: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND BIG DATA, 2016, : 339 - 347
  • [39] Column-Oriented Datalog Materialization for Large Knowledge Graphs
    Urbani, Jacopo
    Jacobs, Ceriel
    Kroetzsch, Markus
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 258 - 264
  • [40] Logical Schema for Data Warehouse on Column-Oriented NoSQL Databases
    Boussahoua, Mohamed
    Boussaid, Omar
    Bentayeb, Fadila
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT II, 2017, 10439 : 247 - 256