Computationally Efficient Exact Calculation of Kernel Density Derivatives

被引:1
|
作者
Shaker, Matineh [1 ]
Myhre, Jonas Nordhaug [2 ]
Erdogmus, Deniz [1 ]
机构
[1] Northeastern Univ, Dept Elect & Comp Engn, Dana Res Ctr 409, Boston, MA 02115 USA
[2] Univ Tromso, Dept Phys & Technol, N-9037 Tromso, Norway
关键词
Kernel density estimate; Kernel density derivative estimate; Multivariate; High dimensional; Computational complexity; Efficient algorithm; Principal surfaces;
D O I
10.1007/s11265-014-0904-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning research related to the derivatives of the kernel density estimator has received limited attention compared to the density estimator itself. This is despite of the general consensus that most of the important features of a data distribution, such as modes, curvature or even cluster structure, are characterized by its derivatives. In this paper we present a computationally efficient algorithm to calculate kernel density estimates and their derivatives for linearly separable kernels, with significant savings especially for high dimensional data and higher order derivatives. It significantly reduces the number of operations (multiplications and derivative evaluations) to calculate the estimates, while keeping results exact (i.e. no approximations are involved). The main idea is that the calculation of multivariate separable kernels and their derivatives, such as the gradient vector and the Hessian matrix involves significant number of redundant operations that can be eliminated using the chain rule. A tree-based algorithm that calculates exact kernel density estimate and derivatives in the most efficient fashion is presented with the particular focus being on optimizing kernel evaluations for individual data pairs. In contrast, most approaches in the literature resort to approximations of functions or downsampling. Overall computational savings of the presented method could be further increased by incorporating such approximations, which aim to reduce the number of pairs of data considered. The theoretical computational complexity of the tree-based and direct methods that perform all multiplications are compared. In experimental results, calculating separable kernels and their derivatives is considered, as well as a measure that evaluates how close a point is to the principal curve of a density, which employs first and second derivatives. These results indicate considerable improvement in computational complexity, hence time over the direct approach.
引用
收藏
页码:321 / 332
页数:12
相关论文
共 50 条
  • [41] EXACT CALCULATION OF THE ENERGY DENSITY OF COSMOLOGICAL GRAVITATIONAL-WAVES
    MENDES, LE
    HENRIQUES, AB
    MOORHOUSE, RG
    PHYSICAL REVIEW D, 1995, 52 (04): : 2083 - 2088
  • [42] Efficient On-Line Nonparametric Kernel Density Estimation
    C. G. Lambert
    S. E. Harrington
    C. R. Harvey
    A. Glodjo
    Algorithmica, 1999, 25 : 37 - 57
  • [43] Efficient on-line nonparametric kernel density estimation
    Lambert, CG
    Harrington, SE
    Harvey, CR
    Glodjo, A
    ALGORITHMICA, 1999, 25 (01) : 37 - 57
  • [44] Calculation of the aeolian sediment flux-density profile based on estimation of the kernel density
    Li, Meng
    Dong, Zhibao
    Zhang, Zhengcai
    AEOLIAN RESEARCH, 2015, 16 : 49 - 54
  • [45] Exact and computationally efficient likelihood-based estimation for discretely observed diffusion processes (with discussion)
    Beskos, Alexandros
    Papaspiliopoulos, Omiros
    Roberts, Gareth O.
    Fearnhead, Paul
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2006, 68 : 333 - 361
  • [46] Root n bandwidth selectors for kernel estimation of density derivatives
    Wu, TJ
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (438) : 536 - 547
  • [47] Information bound for bandwidth selection in kernel estimation of density derivatives
    Wu, TJ
    Lin, Y
    STATISTICA SINICA, 2000, 10 (02) : 457 - 473
  • [48] On the asymptotic normality of multistage integrated density derivatives kernel estimators
    Tenreiro, C
    STATISTICS & PROBABILITY LETTERS, 2003, 64 (03) : 311 - 322
  • [49] A note on application of kernel derivatives in density estimation with the univariate case
    Siloko, I. U.
    Ikpotokin, O.
    Oyegue, F. O.
    Ishiekwene, C. C.
    Afere, B. A. E.
    JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2019, 22 (03): : 415 - 423
  • [50] Exact exchange kernel for time-dependent density-functional theory
    Gorling, A
    INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 1998, 69 (03) : 265 - 277