Sketching information divergences

被引:10
|
作者
Guha, Sudipto [2 ]
Indyk, Piotr [3 ]
McGregor, Andrew [1 ]
机构
[1] Univ Calif San Diego, Informat Theory & Applicat Ctr, San Diego, CA 92109 USA
[2] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
[3] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
Information divergences; Data stream model; Sketches; Communication complexity; Approximation algorithms;
D O I
10.1007/s10994-008-5054-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When comparing discrete probability distributions, natural measures of similarity are not l(p) distances but rather are information divergences such as Kullback-Leibler and Hellinger. This paper considers some of the issues related to constructing small-space sketches of distributions in the data-stream model, a concept related to dimensionality reduction, such that these measures can be approximated from the sketches. Related problems for l(p) distances are reasonably well understood via a series of results by Johnson and Linden-strauss (Contemp. Math. 26:189-206, 1984), Alon et al. (J. Comput. Syst. Sci. 58(1): 137 147, 1999), Indyk (IEEE Symposium on Foundations of Computer Science, pp. 202-208, 2000), and Brinkman and Charikar (IEEE Symposium on Foundations of Computer Science, pp. 514-523, 2003). In contrast, almost no analogous results are known to date about constructing sketches for the information divergences used in statistics and learning theory. Our main result is an impossibility result that shows that no small-space sketches exist for the multiplicative approximation of any commonly used f-divergences and Bregman divergences with the notable exceptions of l(1) and l(2) where small-space sketches exist. We then present data-stream algorithms for the additive approximation of a wide range of information divergences. Throughout, our emphasis is on providing general characterizations.
引用
收藏
页码:5 / 19
页数:15
相关论文
共 50 条
  • [41] Closed-Form Information-Theoretic Divergences for Statistical Mixtures
    Nielsen, Frank
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1723 - 1726
  • [42] Information Geometry of Generalized Bayesian Prediction Using α-Divergences as Loss Functions
    Zhang, Fode
    Shi, Yimin
    Ng, Hon Keung Tony
    Wang, Ruibing
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (03) : 1812 - 1824
  • [43] Sketching by design: teaching sketching to young learners
    Kelley, Todd R.
    Sung, Euisuk
    INTERNATIONAL JOURNAL OF TECHNOLOGY AND DESIGN EDUCATION, 2017, 27 (03) : 363 - 386
  • [44] Using sketching to aid the collaborative design of Information Visualisation software - A case study
    Craft, Brock
    Cairns, Paul
    HUMAN WORK INTERACTION DESIGN: DESIGNING FOR HUMAN WORK, 2006, 221 : 103 - +
  • [45] Sketching as a Technique to Eliciting Information and Cues to Deceit in Interpreter-Based Interviews
    Vrij, Aldert
    Leal, Sharon
    Fisher, Ronald P.
    Mann, Samantha
    Dalton, Gary
    Jo, Eunkyung
    Shabolta, Alla
    Khaleeva, Maria
    Granskaya, Juliana
    Houston, Kate
    JOURNAL OF APPLIED RESEARCH IN MEMORY AND COGNITION, 2018, 7 (02) : 303 - 313
  • [46] Eliciting information and cues to deceit through sketching in interpreter-based interviews
    Vrij, Aldert
    Leal, Sharon
    Fisher, Ronald P.
    Mann, Samantha
    Jo, Eunkyung
    Shaboltas, Alla
    Khaleeva, Maria
    Granskaya, Juliana
    Houston, Kate
    APPLIED COGNITIVE PSYCHOLOGY, 2019, 33 (06) : 1197 - 1211
  • [47] CONCEPT SKETCHING FOR THE NEXT-GENERATION OF MACHINE PRODUCTION INFORMATION-SYSTEMS
    KOJIMA, T
    HATTORI, M
    OKAZAKI, T
    COMPUTER INTEGRATED MANUFACTURING SYSTEMS, 1993, 6 (02): : 135 - 141
  • [48] The effects of sketching while narrating on information elicitation and deception detection in multiple interviews
    Deeb, Haneen
    Vrij, Aldert
    Leal, Sharon
    Burkhardt, Jennifer
    ACTA PSYCHOLOGICA, 2021, 213
  • [49] Sketching in HCI: Hands-on Course of Sketching Techniques
    Lewis, Makayla
    Sturdee, Miriam
    Marquardt, Nicolai
    CHI EA '19 EXTENDED ABSTRACTS: EXTENDED ABSTRACTS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [50] Divergences
    Benn, Gottfried
    POETRY, 2012, 199 (06) : 494 - 494