Distributed Adaptive Sampling for Kernel Matrix Approximation

被引:0
|
作者
Calandriello, Daniele [1 ]
Lazaric, Alessandro [1 ]
Valko, Michal [1 ]
机构
[1] INRIA Lille Nord Europe, SequeL Team, Villeneuve Dascq, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most kernel-based methods, such as kernel regression, kernel PCA, ICA, or k-means clustering, do not scale to large datasets, because constructing and storing the kernel matrix K-n requires at least O(n(2)) time and space for n samples. Recent works [1, 9] show that sampling points with replacement according to their ridge leverage scores (RLS) generates small dictionaries of relevant points with strong spectral approximation guarantees for K-n. The drawback of RLS-based methods is that computing exact RLS requires constructing and storing the whole kernel matrix. In this paper, we introduce SQUEAK, a new algorithm for kernel approximation based on RLS sampling that sequentially processes the dataset, storing a dictionary which creates accurate kernel matrix approximations with a number of points that only depends on the effective dimension d(eff) (gamma) of the dataset. Moreover since all the RLS estimations are efficiently performed using only the small dictionary, SQUEAK never constructs the whole matrix Kn, runs in linear time (O) over tilde (nd(eff) (gamma)(3)) w.r.t. n, and requires only a single pass over the dataset. We also propose a parallel and distributed version of SQUEAK achieving similar accuracy in as little as (O) over tilde (log(n)d(eff)(gamma)(3)) time.
引用
收藏
页码:1421 / 1429
页数:9
相关论文
共 50 条
  • [1] Distributed Kernel Matrix Approximation and Implementation Using Message Passing Interface
    Dameh, Taher A.
    Abd-Almageed, Wael
    Hefeeda, Mohamed
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 52 - 57
  • [2] Adaptive sampling and fast low-rank matrix approximation
    Deshpande, Amit
    Vempala, Santosh
    APPROXIMATION, RANDOMIZATION AND COMBINATORIAL OPTIMIZATION: ALGORITHMS AND TECHNIQUES, 2006, 4110 : 292 - 303
  • [3] KERNEL MATRIX APPROXIMATION FOR LEARNING THE KERNEL HYPERPARAMETERS
    Fauvel, Mathieu
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 5418 - 5421
  • [4] Improving CUR Matrix Decomposition and the Nystrom Approximation via Adaptive Sampling
    Wang, Shusen
    Zhang, Zhihua
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 2729 - 2769
  • [5] Uniform Sampling for Matrix Approximation
    Cohen, Michael B.
    Lee, Yin Tat
    Musco, Cameron
    Musco, Christopher
    Peng, Richard
    Sidford, Aaron
    PROCEEDINGS OF THE 6TH INNOVATIONS IN THEORETICAL COMPUTER SCIENCE (ITCS'15), 2015, : 181 - 190
  • [6] Approximation Guarantees for Adaptive Sampling
    Balkanski, Eric
    Singer, Yaron
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [7] Improving CUR matrix decomposition and the nyström approximation via adaptive sampling
    College of Computer Science and Technology, Zhejiang University, Hangzhou, Zhejiang 310027, China
    不详
    J. Mach. Learn. Res., (2729-2769):
  • [8] Greedy algorithms with Kernel matrix approximation
    College of Electrical and Control Engineering, Xi'an University of Science and Technology, Xi'an 710054, China
    Moshi Shibie yu Rengong Zhineng, 2007, 1 (138-143):
  • [9] An adaptive approximation for Gaussian wavelet kernel
    Ha, Young-Mok
    Yoon, Ji Won
    2016 18TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - INFORMATION AND COMMUNICATIONS FOR SAFE AND SECURE LIFE, 2016, : 576 - 580
  • [10] Sampling based succinct matrix approximation
    Liu, Rong
    Shi, Yong
    STATISTICS & PROBABILITY LETTERS, 2008, 78 (09) : 1138 - 1147