Optimization of Column-oriented Storage Compression Strategy Based on Hbase

被引:0
|
作者
Sun, Jingchao [1 ]
Lu, Tianliang [1 ]
机构
[1] Peoples Publ Secur Univ China, Sch Informat Technol & Network Secur, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Column-oriented storage; Data compression; HBase; Selection method of compression strategy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the problem of high learning cost and low compression efficiency caused by large data dispersion, this paper presents a sorted-based compression strategy selection method for HBase. Firstly, a method to sort the data in each column is designed according to the characteristics of HBase to strengthen the data compaction. Secondly, according to the characteristics of the data, a column-based compression strategy is proposed to recommend the compression scheme. Experiments on TPC-DS standard dataset show its competitive performance as compared with the other state-of-the-art methods.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [41] Efficient column-oriented processing for mutual subspace skyline queries
    Jiang, Tao
    Zhang, Bin
    Lin, Dan
    Gao, Yunjun
    LI, Qing
    SOFT COMPUTING, 2020, 24 (20) : 15427 - 15445
  • [42] SCORD: Shuffling Column-Oriented Relational Database to Enhance Security
    Geng, Tieming
    Huang, Chin-Tser
    Farkas, Csilla
    UBIQUITOUS SECURITY, UBISEC 2023, 2024, 2034 : 163 - 176
  • [43] From Document Warehouse to Column-Oriented NoSQL Document Warehouse
    Ben Messaoud, Ines
    Ben Ali, Refka
    Feki, Jamel
    ICSOFT: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2017, : 85 - 94
  • [44] Efficient column-oriented processing for mutual subspace skyline queries
    Tao Jiang
    Bin Zhang
    Dan Lin
    Yunjun Gao
    Qing LI
    Soft Computing, 2020, 24 : 15427 - 15445
  • [45] Formalizing the Mapping of UML Conceptual Schemas to Column-Oriented Databases
    Abdelhedi, Fatma
    Brahim, Amal Ait
    Zurfluh, Gilles
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2018, 14 (03) : 44 - 68
  • [46] ColumnSGD: A Column-oriented Framework for Distributed Stochastic Gradient Descent
    Zhang, Zhipeng
    Wu, Wentao
    Jiang, Jiawei
    Yu, Lele
    Cui, Bin
    Zhang, Ce
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1513 - 1524
  • [47] MDICA: Maintenance of data integrity in column-oriented database applications
    Suarez-Cabal, Maria Jose
    Suarez-Otero, Pablo
    de la Riva, Claudio
    Tuya, Javier
    COMPUTER STANDARDS & INTERFACES, 2023, 83
  • [48] On the convergence of nonstationary column-oriented version of algebraic iterative methods
    Karimpour, Mehdi
    Nikazad, Touraj
    MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2020, 43 (10) : 6131 - 6139
  • [49] One-parametric analysis of column-oriented linear programs
    Larsson, Torbjorn
    Quttineh, Nils-Hassan
    OPERATIONS RESEARCH PERSPECTIVES, 2023, 10
  • [50] An OAIS-Based Hospital Information System on the Cloud: Analysis of a NoSQL Column-Oriented Approach
    Celesti, Antonio
    Fazio, Maria
    Romano, Agata
    Bramanti, Alessia
    Bramanti, Placido
    Villari, Massimo
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (03) : 912 - 918