Computationally Efficient Estimation of Squared-Loss Mutual Information with Multiplicative Kernel Models

被引:9
|
作者
Sakai, Tomoya [1 ]
Sugiyama, Masashi [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 1528552, Japan
来源
关键词
squared-loss mutual information; least-squares mutual information; density ratio estimation; multiplicative kernel models; independence test;
D O I
10.1587/transinf.E97.D.968
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Squared-loss mutual information (SMI) is a robust measure of the statistical dependence between random variables. The sample-based SMI approxirnator called least-squares mutual information (LSMI) was demonstrated to be useful in performing various machine learning tasks such as dimension reduction, clustering, and causal inference. The original LSMI approximates the pointwise mutual information by using the kernel model, which is a linear combination of kernel basis functions located on paired data samples. Although LSMI was proved to achieve the optimal approximation accuracy asymptotically, its approximation capability is limited when the sample size is small due to an insufficient number of kernel basis functions. Increasing the number of kernel basis functions can mitigate this weakness, but a naive implementation of this idea significantly increases the computation costs. In this article, we show that the computational complexity of LSMI with the multiplicative kernel model, which locates kernel basis functions on unpaired data samples and thus the number of kernel basis functions is the sample size squared, is the same as that for the plain kernel model. We experimentally demonstrate that LSMI with the multiplicative kernel model is more accurate than that with plain kernel models in small sample cases, with only mild increase in computation time.
引用
收藏
页码:968 / 971
页数:4
相关论文
共 50 条
  • [21] Computationally efficient learning of multivariate t mixture models with missing information
    Lin, Tsung-I
    Ho, Hsiu J.
    Shen, Pao S.
    COMPUTATIONAL STATISTICS, 2009, 24 (03) : 375 - 392
  • [22] Computationally efficient learning of multivariate t mixture models with missing information
    Tsung-I Lin
    Hsiu J. Ho
    Pao S. Shen
    Computational Statistics, 2009, 24 : 375 - 392
  • [23] Computationally efficient parameter estimation for high-dimensional ocean biogeochemical models
    Kern, Skyler
    McGuinn, Mary E.
    Smith, Katherine M.
    Pinardi, Nadia
    Niemeyer, Kyle E.
    Lovenduski, Nicole S.
    Hamlington, Peter E.
    GEOSCIENTIFIC MODEL DEVELOPMENT, 2024, 17 (02) : 621 - 649
  • [24] Semi-supervised Feature Selection by Mutual Information Based on Kernel Density Estimation
    Xu, Siqi
    Dai, Jianhua
    Shi, Hong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 818 - 823
  • [25] ESTIMATION OF THE BLURRING KERNEL IN EXPERIMENTAL HR-PQCT IMAGES BASED ON MUTUAL INFORMATION
    Li, Y.
    Sixou, B.
    Peyrin, F.
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2086 - 2090
  • [26] Robust estimation and empirical likelihood inference with exponential squared loss for panel data models
    Li, Shaomin
    Wang, Kangning
    Ren, Yanyan
    ECONOMICS LETTERS, 2018, 164 : 19 - 23
  • [27] Estimation of Statistical Translation Models Based on Mutual Information for Ad Hoc Information Retrieval
    Karimzadehgan, Maryam
    Zhai, ChengXiang
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 323 - 330
  • [28] Bayesian Experimental Design for Implicit Models by Mutual Information Neural Estimation
    Kleinegesse, Steven
    Gutmann, Michael U.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [29] A new histogram-based estimation technique of entropy and mutual information using mean squared error minimization
    Hacine-Gharbi, A.
    Deriche, M.
    Ravier, P.
    Harba, R.
    Mohamadi, T.
    COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (03) : 918 - 933
  • [30] Mutual information independence model using kernel density estimation for segmenting and labeling sequential data
    Zhou, GD
    Yang, LP
    Su, J
    Ji, DH
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 155 - 166