Optimization of Column-oriented Storage Compression Strategy Based on Hbase

被引:0
|
作者
Sun, Jingchao [1 ]
Lu, Tianliang [1 ]
机构
[1] Peoples Publ Secur Univ China, Sch Informat Technol & Network Secur, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Column-oriented storage; Data compression; HBase; Selection method of compression strategy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the problem of high learning cost and low compression efficiency caused by large data dispersion, this paper presents a sorted-based compression strategy selection method for HBase. Firstly, a method to sort the data in each column is designed according to the characteristics of HBase to strengthen the data compaction. Secondly, according to the characteristics of the data, a column-based compression strategy is proposed to recommend the compression scheme. Experiments on TPC-DS standard dataset show its competitive performance as compared with the other state-of-the-art methods.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [21] Fixed-length String Compression for Direct Operations in Column-oriented Databases
    KeYan
    Xie, Meiyi
    Zhu, Hong
    2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 1171 - 1176
  • [22] Document-oriented versus Column-oriented Data Storage for Social Graph Data Warehouse
    Challal, Zakia
    Bala, Wafaa
    Mokeddem, Hanifa
    Boukhalfa, Kamel
    Boussaid, Omar
    Benkhelifa, Elhadj
    2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 242 - 247
  • [23] Column-oriented query execution engine for OLAP based on triplet
    Zhu, Yue-An
    Zhang, Yan-Song
    Zhou, Xuan
    Wang, Shan
    Ruan Jian Xue Bao/Journal of Software, 2014, 25 (04): : 753 - 767
  • [24] Materialization strategies in a column-oriented DBMS
    Abadi, Daniel J.
    Myers, Daniel S.
    DeWitt, David J.
    Madden, Samuel R.
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 441 - +
  • [25] VLog: A Column-Oriented Datalog Reasoner
    Urbani, Jacopo
    Jacobs, Ceriel
    Kroetzsch, Markus
    KI 2016: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2016, 9904 : 230 - 236
  • [26] Optimized Column-Oriented Model: A Storage and Search Efficient Representation of Medical Data
    Paul, Razan
    Hogue, Abu Sayed Md Latiful
    INFORMATION TECHNOLOGY IN BIO- AND MEDICAL INFORMATICS, 2010, 6266 : 118 - 127
  • [27] Join strategy optimization in column storage based query
    Sun, L. (sli@dhu.edu.cn), 1647, Science Press (50):
  • [28] Column-oriented metadata organization of vision objects
    Bach, M. (Malgorzata.Bach@polsl.pl), 1600, Springer Verlag (118):
  • [29] The Column-oriented Data Store Performance Considerations
    Nowosielski, Artur
    Kowalski, Piotr A.
    Kulczyzki, Piotr
    PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 877 - 881
  • [30] Column-oriented Database Acceleration using FPGAs
    Watanabe, Satoru
    Fujimoto, Kazuhisa
    Saeki, Yuji
    Fujikawa, Yoshifumi
    Yoshino, Hiroshi
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 686 - 697