Improving State-of-the-Art Compression Techniques for Log Management Tools

被引:4
|
作者
Yao, Kundi [1 ]
Sayagh, Mohammed [2 ]
Shang, Weiyi [3 ]
Hassan, Ahmed E. [1 ]
机构
[1] Queens Univ, Software Anal & Intelligence Lab SAIL, Kingston, ON K7L 3N6, Canada
[2] ETS Quebec Univ, Software & IT Engn Dept, Montreal, PQ H3C 1K3, Canada
[3] Concordia Univ, Dept Comp Sci & Software Engn, Montreal, PQ, Canada
关键词
Tools; Indexes; IP networks; Software systems; Runtime; Monitoring; Message systems; Software log compression; software logging; log management tools;
D O I
10.1109/TSE.2021.3069958
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Log data records important runtime information about the running of a software system for different purposes including performance assurance, capacity planning, and anomaly detection. Log management tools such as ELK Stack and Splunk are widely adopted to manage and leverage log data in order to assist DevOps in real-time log analytics and decision making. To enable fast queries and to save storage space, such tools split log data into small blocks (e.g., 16KB), then index and compress each block separately. Previous log compression studies focus on improving the compression of either large-sized log files or log streams, without considering improving the compression of small log blocks (the actual compression need by modern log management tools). The evaluation of four state-of-the-art compression approaches (e.g., Logzip, a variation of Logzip by pre-extracting log templates named Logzip-E, LogArchive and Cowic) indicates that these approaches do not perform well on small log blocks. In fact, the compressed blocks that are preprocessed using Logzip, Logzip-E, LogArchive or Cowic are even larger (on median 1.3 times, 1.5 times, 0.2 times or 6.6 times) than the compressed blocks without any preprocessing. Hence, we propose an approach named LogBlock to preprocess small log blocks before compressing them with a general compressor such as gzip, deflate and lz4, which are widely adopted by log management tools. LogBlock reduces the repetitiveness of logs by preprocessing the log headers and rearranging the log content leading to an improved compression ratio for a log file. Our evaluation on 16 log files shows that, for 16KB to 128KB block sizes, the compressed blocks by LogBlock are on median 5 to 21 percent smaller than the same compressed blocks without preprocessing (outperforming the state-of-the-art compression approaches). LogBlock achieves both a higher compression ratio (a median of 1.7 to 8.4 times, 1.9 to 10.0 times, 1.3 to 1.9 times and 6.2 to 11.4 times) and a faster compression speed (a median of 30.8 to 49.7 times, 42.6 to 53.8 times, 4.5 to 6.0 times and 2.5 to 4.0 times) than Logzip, Logzip-E, LogArchive and Cowic. LogBlock can help improve the storage efficiency of log management tools.
引用
收藏
页码:2748 / 2760
页数:13
相关论文
共 50 条
  • [1] Lossless Image Compression Techniques: A State-of-the-Art Survey
    Rahman, Md Atiqur
    Hamada, Mohamed
    [J]. SYMMETRY-BASEL, 2019, 11 (10):
  • [2] Studies of exotic nuclei: state-of-the-art experimental tools and techniques
    Paschalis, Stefatios
    [J]. FAIRNESS 2014: FAIR NEXT GENERATION SCIENTISTS 2014, 2015, 599
  • [3] State-of-the-Art Imaging Techniques in Metastatic Spinal Cord Compression
    Kuah, Tricia
    Vellayappan, Balamurugan A.
    Makmur, Andrew
    Nair, Shalini
    Song, Junda
    Tan, Jiong Hao
    Kumar, Naresh
    Quek, Swee Tian
    Hallinan, James Thomas Patrick Decourcy
    [J]. CANCERS, 2022, 14 (13)
  • [4] The Impact of State-of-the-Art Techniques for Lossless Still Image Compression
    Rahman, Md Atiqur
    Hamada, Mohamed
    Shin, Jungpil
    [J]. ELECTRONICS, 2021, 10 (03) : 1 - 40
  • [5] EVOLUTION AND STATE-OF-THE-ART OF THE LOG BARGE
    MCALLISTER, M
    [J]. MARINE TECHNOLOGY AND SNAME NEWS, 1995, 32 (02): : 132 - 139
  • [6] BIM and Digital Tools for State-of-the-Art Construction Cost Management
    Sepasgozar, Samad M. E.
    Costin, Aaron M.
    Karimi, Reyhaneh
    Shirowzhan, Sara
    Abbasian, Ezatollah
    Li, Jinyun
    [J]. BUILDINGS, 2022, 12 (04)
  • [7] State-of-the-art Tools and Techniques for Quantitative Modeling and Analysis of Embedded Systems
    Bozga, Marius
    David, Alexandre
    Hartmanns, Arnd
    Hermanns, Holger
    Larsen, Kim G.
    Legay, Axel
    Tretmans, Jan
    [J]. DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2012), 2012, : 370 - 375
  • [8] UNIVERSAL NOISELESS AND NOISY DATA-COMPRESSION TECHNIQUES - THE STATE-OF-THE-ART
    COX, BV
    LINDSAY, RA
    [J]. COMPUTER COMMUNICATIONS, 1987, 10 (04) : 179 - 189
  • [9] TOLERANCING TECHNIQUES - THE STATE-OF-THE-ART
    ZHANG, HC
    HUQ, ME
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1992, 30 (09) : 2111 - 2135
  • [10] Cryptography and state-of-the-art techniques
    Ahmed, Mohiuddin
    Sazzad, T.M. Shahriar
    Mollah, Md. Elias
    [J]. International Journal of Computer Science Issues, 2012, 9 (2 2-3): : 583 - 586