Distributed Adaptive Sampling for Kernel Matrix Approximation

被引：0

作者：

Calandriello, Daniele ^{[1
]}

Lazaric, Alessandro ^{[1
]}

Valko, Michal ^{[1
]}

机构：

[1] INRIA Lille Nord Europe, SequeL Team, Villeneuve Dascq, France

来源：

ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54 | 2017年 / 54卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most kernel-based methods, such as kernel regression, kernel PCA, ICA, or k-means clustering, do not scale to large datasets, because constructing and storing the kernel matrix K-n requires at least O(n(2)) time and space for n samples. Recent works [1, 9] show that sampling points with replacement according to their ridge leverage scores (RLS) generates small dictionaries of relevant points with strong spectral approximation guarantees for K-n. The drawback of RLS-based methods is that computing exact RLS requires constructing and storing the whole kernel matrix. In this paper, we introduce SQUEAK, a new algorithm for kernel approximation based on RLS sampling that sequentially processes the dataset, storing a dictionary which creates accurate kernel matrix approximations with a number of points that only depends on the effective dimension d(eff) (gamma) of the dataset. Moreover since all the RLS estimations are efficiently performed using only the small dictionary, SQUEAK never constructs the whole matrix Kn, runs in linear time (O) over tilde (nd(eff) (gamma)(3)) w.r.t. n, and requires only a single pass over the dataset. We also propose a parallel and distributed version of SQUEAK achieving similar accuracy in as little as (O) over tilde (log(n)d(eff)(gamma)(3)) time.

引用

页码：1421 / 1429

页数：9

共 50 条

[1] Distributed Kernel Matrix Approximation and Implementation Using Message Passing Interface
Dameh, Taher A.
Abd-Almageed, Wael
Hefeeda, Mohamed
2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 52 - 57
[2] Adaptive sampling and fast low-rank matrix approximation
Deshpande, Amit
Vempala, Santosh
APPROXIMATION, RANDOMIZATION AND COMBINATORIAL OPTIMIZATION: ALGORITHMS AND TECHNIQUES, 2006, 4110 : 292 - 303
[3] KERNEL MATRIX APPROXIMATION FOR LEARNING THE KERNEL HYPERPARAMETERS
Fauvel, Mathieu
2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 5418 - 5421
[4] Improving CUR Matrix Decomposition and the Nystrom Approximation via Adaptive Sampling
Wang, Shusen
Zhang, Zhihua
JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 2729 - 2769
[5] Uniform Sampling for Matrix Approximation
Cohen, Michael B.
Lee, Yin Tat
Musco, Cameron
Musco, Christopher
Peng, Richard
Sidford, Aaron
PROCEEDINGS OF THE 6TH INNOVATIONS IN THEORETICAL COMPUTER SCIENCE (ITCS'15), 2015, : 181 - 190
[6] Approximation Guarantees for Adaptive Sampling
Balkanski, Eric
Singer, Yaron
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[7] Improving CUR matrix decomposition and the nyström approximation via adaptive sampling
College of Computer Science and Technology, Zhejiang University, Hangzhou, Zhejiang 310027, China
不详
J. Mach. Learn. Res., (2729-2769):
[8] Greedy algorithms with Kernel matrix approximation
College of Electrical and Control Engineering, Xi'an University of Science and Technology, Xi'an 710054, China
Moshi Shibie yu Rengong Zhineng, 2007, 1 (138-143):
[9] An adaptive approximation for Gaussian wavelet kernel
Ha, Young-Mok
Yoon, Ji Won
2016 18TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - INFORMATION AND COMMUNICATIONS FOR SAFE AND SECURE LIFE, 2016, : 576 - 580
[10] Sampling based succinct matrix approximation
Liu, Rong
Shi, Yong
STATISTICS & PROBABILITY LETTERS, 2008, 78 (09) : 1138 - 1147

← 1 2 3 4 5 →