Universal Lossless Compression of Graphical Data

被引:8
|
作者
Delgosha, Payam [1 ]
Anantharam, Venkat [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Convergence; Stochastic processes; Entropy; Social networking (online); Data models; Systems biology; Data compression; Graph compression; sparse graphs; local weak convergence; data compression; universal lossless compression;
D O I
10.1109/TIT.2020.2991384
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graphical data is comprised of a graph with marks on its edges and vertices. The mark indicates the value of some attribute associated to the respective edge or vertex. Examples of such data arise in social networks, molecular and systems biology, and web graphs, as well as in several other application areas. Our goal is to design schemes that can efficiently compress such graphical data without making assumptions about its stochastic properties. Namely, we wish to develop a universal compression algorithm for graphical data sources. To formalize this goal, we employ the framework of local weak convergence, also called the objective method, which provides a technique to think of a marked graph as a kind of stationary stochastic processes, stationary with respect to movement between vertices of the graph. In recent work, we have generalized a notion of entropy for unmarked graphs in this framework, due to Bordenave and Caputo, to the case of marked graphs. We use this notion to evaluate the efficiency of a compression scheme. The lossless compression scheme we propose in this paper is then proved to be universally optimal in a precise technical sense. It is also capable of performing local data queries in the compressed form.
引用
收藏
页码:6962 / 6976
页数:15
相关论文
共 50 条
  • [1] Universal Lossless Compression of Graphical Data
    Delgosha, Payam
    Anantharam, Venkat
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 1578 - 1582
  • [2] Universal lossless data embedding without explicit compression
    Yokoo, H
    [J]. PROCEEDINGS OF THE IEEE ITSOC INFORMATION THEORY WORKSHOP 2005 ON CODING AND COMPLEXITY, 2005, : 268 - 272
  • [3] Universal data compression with side information at the decoder by using traditional universal lossless compression algorithms
    Yang, En-hui
    He, Da-ke
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS, VOLS 1-7, 2007, : 431 - +
  • [4] Universal Approximation of Multi-copy States and Universal Quantum Lossless Data Compression
    Hayashi, Masahito
    [J]. COMMUNICATIONS IN MATHEMATICAL PHYSICS, 2010, 293 (01) : 171 - 183
  • [5] Universal Approximation of Multi-copy States and Universal Quantum Lossless Data Compression
    Masahito Hayashi
    [J]. Communications in Mathematical Physics, 2010, 293 : 171 - 183
  • [6] Boosting the performance of LZW compression through block sorting for universal lossless data compression
    Saha, Sajib Kumar
    Rahaman, Md. Masudur
    [J]. PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 330 - +
  • [7] Universal Lossless Compression of Erased Symbols
    Yu, Jiming
    Verdu, Sergio
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2008, 54 (12) : 5563 - 5574
  • [8] An algorithm for universal lossless compression with side information
    Cai, Haixiao
    Kulkarni, Sanjeev R.
    Verdu, Sergio
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (09) : 4008 - 4016
  • [9] Universal Lossless Compression-based Denoising
    Su, Han-I
    Weissman, Tsachy
    [J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2010, : 1648 - 1652
  • [10] On Lossless Universal Compression of Distributed Identical Sources
    Beirami, Ahmad
    Fekri, Faramarz
    [J]. 2012 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2012, : 561 - 565