Optimization of Column-oriented Storage Compression Strategy Based on Hbase

被引:0
|
作者
Sun, Jingchao [1 ]
Lu, Tianliang [1 ]
机构
[1] Peoples Publ Secur Univ China, Sch Informat Technol & Network Secur, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Column-oriented storage; Data compression; HBase; Selection method of compression strategy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the problem of high learning cost and low compression efficiency caused by large data dispersion, this paper presents a sorted-based compression strategy selection method for HBase. Firstly, a method to sort the data in each column is designed according to the characteristics of HBase to strengthen the data compaction. Secondly, according to the characteristics of the data, a column-based compression strategy is proposed to recommend the compression scheme. Experiments on TPC-DS standard dataset show its competitive performance as compared with the other state-of-the-art methods.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [1] Data storage optimization strategy in distributed column-oriented database by considering spatial adjacency
    Kun Zheng
    Danpeng Gu
    Falin Fang
    Miao Zhang
    Kang Zheng
    Qi Li
    Cluster Computing, 2017, 20 : 2833 - 2844
  • [2] Data storage optimization strategy in distributed column-oriented database by considering spatial adjacency
    Zheng, Kun
    Gu, Danpeng
    Fang, Falin
    Zhang, Miao
    Zheng, Kang
    Li, Qi
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 2833 - 2844
  • [3] COMPRESSION OF TEXTUAL COLUMN-ORIENTED DATA
    Garcia, Vinicius Fulber
    Sardi Mergen, Sergio Luis
    COMPUTING AND INFORMATICS, 2018, 37 (02) : 405 - 423
  • [4] Column-Oriented Storage Techniques for MapReduce
    Floratou, Avrilia
    Patel, Jignesh M.
    Shekita, Eugene J.
    Tata, Sandeep
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (07): : 419 - 429
  • [5] ECOS: Evolutionary Column-Oriented Storage
    Rahman, Syed Saif Ur
    Schallehn, Eike
    Saake, Gunter
    ADVANCES IN DATABASES, 2011, 7051 : 18 - 32
  • [6] The column-oriented database partitioning optimization based on the natural computing algorithms
    Nowosielski, Artur
    Kowalski, Piotr A.
    Kulczycki, Piotr
    PROCEEDINGS OF THE 2015 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 5 : 1035 - 1041
  • [7] HM: A Column-Oriented MapReduce System on Hybrid Storage
    Wu, Sai
    Chen, Gang
    Chen, Ke
    Li, Feng
    Shou, Lidan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (12) : 3304 - 3317
  • [8] VParC: A Compression Scheme for Numeric Data in Column-Oriented Databases
    Yan, Ke
    Zhu, Hong
    Lu, Kevin
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (01) : 1 - 11
  • [9] Column-oriented databases
    Bößwetter D.
    Puppe F.
    Steinbauer D.
    Informatik-Spektrum, 2010, 33 (01) : 61 - 65
  • [10] Impact of Data Compression on the Performance of Column-oriented Data Stores
    Mladenova, Tsvetelina
    Kalmukov, Yordan
    Marinov, Milko
    Valova, Irena
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 416 - 421