Distributed Adaptive Sampling for Kernel Matrix Approximation

被引:0
|
作者
Calandriello, Daniele [1 ]
Lazaric, Alessandro [1 ]
Valko, Michal [1 ]
机构
[1] INRIA Lille Nord Europe, SequeL Team, Villeneuve Dascq, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most kernel-based methods, such as kernel regression, kernel PCA, ICA, or k-means clustering, do not scale to large datasets, because constructing and storing the kernel matrix K-n requires at least O(n(2)) time and space for n samples. Recent works [1, 9] show that sampling points with replacement according to their ridge leverage scores (RLS) generates small dictionaries of relevant points with strong spectral approximation guarantees for K-n. The drawback of RLS-based methods is that computing exact RLS requires constructing and storing the whole kernel matrix. In this paper, we introduce SQUEAK, a new algorithm for kernel approximation based on RLS sampling that sequentially processes the dataset, storing a dictionary which creates accurate kernel matrix approximations with a number of points that only depends on the effective dimension d(eff) (gamma) of the dataset. Moreover since all the RLS estimations are efficiently performed using only the small dictionary, SQUEAK never constructs the whole matrix Kn, runs in linear time (O) over tilde (nd(eff) (gamma)(3)) w.r.t. n, and requires only a single pass over the dataset. We also propose a parallel and distributed version of SQUEAK achieving similar accuracy in as little as (O) over tilde (log(n)d(eff)(gamma)(3)) time.
引用
收藏
页码:1421 / 1429
页数:9
相关论文
共 50 条
  • [31] Sampling based approximation of linear functionals in reproducing kernel Hilbert spaces
    Santin, Gabriele
    Karvonen, Toni
    Haasdonk, Bernard
    BIT NUMERICAL MATHEMATICS, 2022, 62 (01) : 279 - 310
  • [32] Matrix Approximation and Projective Clustering via Volume Sampling
    Deshpande, Amit
    Rademacher, Luis
    Vempala, Santosh
    Wang, Grant
    PROCEEDINGS OF THE SEVENTHEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2006, : 1117 - 1126
  • [33] Matrix Approximation and Projective Clustering via Volume Sampling
    Deshpande, Amit
    Rademacher, Luis
    Vempala, Santosh
    Wang, Grant
    Theory Comput., (225-247):
  • [34] Global approximation of complex model based on adaptive sampling
    Yin X.-L.
    Qian C.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (09): : 1815 - 1823
  • [35] ACCELERATION ON ADAPTIVE IMPORTANCE SAMPLING WITH SAMPLE AVERAGE APPROXIMATION
    Kawai, Reiichiro
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2017, 39 (04): : A1586 - A1615
  • [36] Matern Kernel Adaptive Filtering With Nystrom Approximation for Indoor Localization
    Dong, Wenhao
    Li, Xifeng
    Bi, Dongjie
    Xie, Yongle
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [37] Accelerating Adaptive Online Learning by Matrix Approximation
    Wan, Yuanyu
    Zhang, Lijun
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT II, 2018, 10938 : 403 - 415
  • [38] Accelerating adaptive online learning by matrix approximation
    Yuanyu Wan
    Lijun Zhang
    International Journal of Data Science and Analytics, 2020, 9 : 389 - 400
  • [39] Adaptive Ensemble Probabilistic Matrix Approximation for Recommendation
    Li, Xingxing
    Jing, Liping
    Liu, Huafeng
    PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 328 - 339
  • [40] Accelerating adaptive online learning by matrix approximation
    Wan, Yuanyu
    Zhang, Lijun
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 9 (04) : 389 - 400