On Optimal Data Compression in Multiterminal Statistical Inference

被引:11
|
作者
Amari, Shun-ichi [1 ]
机构
[1] RIKEN Brain Sci Inst, Wako, Saitama 3510198, Japan
关键词
Data compression; Fisher information; linear-threshold encoding; multiterminal source; multiterminal statistical inference; INFORMATION;
D O I
10.1109/TIT.2011.2162270
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The multiterminal theory of statistical inference deals with the problem of estimating or testing the correlation of letters generated from two (or many) correlated information sources under the restriction of a certain transmission rate for each source. A typical example is two binary sources with joint probability p(x, y) where the correlation of x and y is to be tested or estimated. Given n iid observations x(n) = x(1) ... x(n) and y(n) = y(1) ... y(n), only k = rn (0 < r < 1) bits each can be transmitted to a common destination. What is the optimal data compression for statistical inference? A simple idea is to send the first k letters of x(n) and y(n). A simpler problem is the helper case where the optimal data compression of x(n) is searched for under the condition that all of y(n) are transmitted. It is a long standing problem to determine if there is a better data compression scheme than this simple scheme of sending first k letters. The present paper searches for the optimal data compression under the framework of linear-threshold encoding and shows that there is a better data compression scheme depending on the value of correlation. To this end, we evaluate the Fisher information in the class of linear-threshold compression schemes. It is also proved that the simple scheme is optimal when x and y are independent or their correlation is not too large.
引用
收藏
页码:5577 / 5587
页数:11
相关论文
共 50 条
  • [31] Massive optimal data compression and density estimation for scalable, likelihood-free inference in cosmology
    Alsing, Justin
    Wandelt, Benjamin
    Feeney, Stephen
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2018, 477 (03) : 2874 - 2885
  • [32] Optimal data compression algorithm
    Sadeh, I
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1996, 32 (05) : 57 - 72
  • [33] Human trimodal perception follows optimal statistical inference
    Wozny, David R.
    Beierholm, Ulrik R.
    Shams, Ladan
    JOURNAL OF VISION, 2008, 8 (03):
  • [34] STABILITY AND STATISTICAL INFERENCE FOR SEMIDISCRETE OPTIMAL TRANSPORT MAPS
    Sadhu, Ritwik
    Goldfeld, Ziv
    Kato, Kengo
    ANNALS OF APPLIED PROBABILITY, 2024, 34 (06): : 5694 - 5736
  • [35] Statistical inference of protein structural alignments using information and compression
    Collier, James H.
    Allison, Lloyd
    Lesk, Arthur M.
    Stuckey, Peter J.
    de la Banda, Maria Garcia
    Konagurthu, Arun S.
    BIOINFORMATICS, 2017, 33 (07) : 1005 - 1013
  • [36] Statistical Mechanics of Optimal Convex Inference in High Dimensions
    Advani, Madhu
    Ganguli, Surya
    PHYSICAL REVIEW X, 2016, 6 (03):
  • [37] Statistical mechanics of data compression theorem
    Murayama, T
    ISIT: 2002 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, 2002, : 254 - 254
  • [38] Statistical mechanics of the data compression theorem
    Murayama, T
    JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2002, 35 (08): : L95 - L100
  • [39] INVARIANCE PROPERTIES AND STATISTICAL INFERENCE FOR CIRCULAR DATA
    Mastrantonio, Gianluca
    Lasinio, Giovanna Jona
    Maruotti, Antonello
    Calise, Gianfranco
    STATISTICA SINICA, 2019, 29 (01) : 67 - 80
  • [40] Statistical inference based on Lindley record data
    A. Asgharzadeh
    A. Fallah
    M. Z. Raqab
    R. Valiollahi
    Statistical Papers, 2018, 59 : 759 - 779