Distributed Adaptive Sampling for Kernel Matrix Approximation

被引:0
|
作者
Calandriello, Daniele [1 ]
Lazaric, Alessandro [1 ]
Valko, Michal [1 ]
机构
[1] INRIA Lille Nord Europe, SequeL Team, Villeneuve Dascq, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most kernel-based methods, such as kernel regression, kernel PCA, ICA, or k-means clustering, do not scale to large datasets, because constructing and storing the kernel matrix K-n requires at least O(n(2)) time and space for n samples. Recent works [1, 9] show that sampling points with replacement according to their ridge leverage scores (RLS) generates small dictionaries of relevant points with strong spectral approximation guarantees for K-n. The drawback of RLS-based methods is that computing exact RLS requires constructing and storing the whole kernel matrix. In this paper, we introduce SQUEAK, a new algorithm for kernel approximation based on RLS sampling that sequentially processes the dataset, storing a dictionary which creates accurate kernel matrix approximations with a number of points that only depends on the effective dimension d(eff) (gamma) of the dataset. Moreover since all the RLS estimations are efficiently performed using only the small dictionary, SQUEAK never constructs the whole matrix Kn, runs in linear time (O) over tilde (nd(eff) (gamma)(3)) w.r.t. n, and requires only a single pass over the dataset. We also propose a parallel and distributed version of SQUEAK achieving similar accuracy in as little as (O) over tilde (log(n)d(eff)(gamma)(3)) time.
引用
收藏
页码:1421 / 1429
页数:9
相关论文
共 50 条
  • [21] Fast Kernel Smoothing by a Low-Rank Approximation of the Kernel Toeplitz Matrix
    Guang Deng
    Jonathan H. Manton
    Song Wang
    Journal of Mathematical Imaging and Vision, 2018, 60 : 1181 - 1195
  • [22] Blended kernel approximation in the H-matrix techniques
    Hackbusch, W
    Khoromskij, BN
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2002, 9 (04) : 281 - 304
  • [23] OPTIMIZING ADAPTIVE IMPORTANCE SAMPLING BY STOCHASTIC APPROXIMATION
    Kawai, Reiichiro
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (04): : A2774 - A2800
  • [24] Matrix valued adaptive cross approximation
    Rjasanow, S.
    Weggler, L.
    MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2017, 40 (07) : 2522 - 2531
  • [25] Kernel-based adaptive approximation of functions with discontinuities
    Lenarduzzi, Licia
    Schaback, Robert
    APPLIED MATHEMATICS AND COMPUTATION, 2017, 307 : 113 - 123
  • [26] Adaptive Kernel Function Using Line Transect Sampling
    Albadareen, Baker
    Ismail, Noriszura
    2017 UKM FST POSTGRADUATE COLLOQUIUM, 2018, 1940
  • [27] Adaptive Kernel Graph Nonnegative Matrix Factorization
    Li, Rui-Yu
    Guo, Yu
    Zhang, Bin
    INFORMATION, 2023, 14 (04)
  • [28] RANDOM SAMPLING FOR DISTRIBUTED CODED MATRIX MULTIPLICATION
    Chang, Wei-Ting
    Tandon, Ravi
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8187 - 8191
  • [29] Adaptive Private Distributed Matrix Multiplication
    Bitar, Rawad
    Xhemrishi, Marvin
    Wachter-Zeh, Antonia
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2022, 68 (04) : 2653 - 2673
  • [30] Sampling based approximation of linear functionals in reproducing kernel Hilbert spaces
    Gabriele Santin
    Toni Karvonen
    Bernard Haasdonk
    BIT Numerical Mathematics, 2022, 62 : 279 - 310