Multi-Objective Weighted Sampling

被引:7
|
作者
Cohen, Edith [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
关键词
ASYMPTOTIC THEORY;
D O I
10.1109/HotWeb.2015.8
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Key value data sets of the form {(x, w(x))} where w(x) > 0 are prevalent. Common queries over such data are segment f-statistics Q(f, H) = Sigma(x is an element of H) f(w(x)), specified for a segment H of the keys and a function f. Different choices of f correspond to count, sum, moments, capping, and threshold statistics. When the data set is large, we can compute a smaller sample from which we can quickly estimate statistics. A weighted sample of keys taken with respect to f(w(x)) provides estimates with statistically guaranteed quality for f-statistics. Such a sample S-(f) can be used to estimate g-statistics for g not equal f, but quality degrades with the disparity between g and f. In this paper we address applications that require quality estimates for a set F of different functions. A naive solution is to compute and work with a different sample S-(f) for each f is an element of F. Instead, this can be achieved more effectively and seamlessly using a single multi-objective sample S-(F) of a much smaller size. We review multi-objective sampling schemes and place them in our context of estimating f-statistics. We show that a multi-objective sample for F provides quality estimates for any f that is a positive linear combination of functions from F. We then establish a surprising and powerful result when the target set M is all monotone non-decreasing functions, noting that M includes most natural statistics. We provide efficient multi-objective sampling algorithms for M and show that a sample size of k ln n (where n is the number of active keys) provides the same estimation quality, for any f is an element of M, as a dedicated weighted sample of size k for f.
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [21] The weighted sum method for multi-objective optimization: new insights
    R. Timothy Marler
    Jasbir S. Arora
    [J]. Structural and Multidisciplinary Optimization, 2010, 41 : 853 - 862
  • [22] Weighted Linear Fractional Programming for Possibilistic Multi-objective Problem
    Arbaiy, Nureize
    [J]. COMPUTATIONAL INTELLIGENCE IN INFORMATION SYSTEMS, 2015, 331 : 85 - 94
  • [23] Multi-objective optimisation for optimum allocation in multivariate stratified sampling
    Diaz-Garcia, Jose A.
    Ulloa Cortez, Liliana
    [J]. SURVEY METHODOLOGY, 2008, 34 (02) : 215 - 222
  • [24] An Investigation on Sampling Technique for Multi-objective Restricted Boltzmann Machine
    Shim, Vui Ann
    Tan, Kay Chen
    Chia, Jun Yong
    [J]. 2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [25] Effects of torsional sampling bias on a multi-objective genetic algorithm
    Clark, Robert D.
    Dorfman, Roman J.
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2010, 239
  • [26] A Latin hypercube sampling based multi-objective evolutionary algorithm
    Zheng, Jin-Hua
    Luo, Biao
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2009, 22 (02): : 223 - 233
  • [27] Elite Accumulative Sampling Strategies for Noisy Multi-objective Optimisation
    Fieldsend, Jonathan E.
    [J]. EVOLUTIONARY MULTI-CRITERION OPTIMIZATION, PT II, 2015, 9019 : 172 - 186
  • [28] Multi-Objective Optimization of Sampling Algorithms Pipeline for Unbalanced Problems
    Miranda, Pericles B. C.
    Mello, Rafael Ferreira
    Nascimento, Andre C. A.
    Si, Tapas
    [J]. 2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
  • [29] MORRF* : Sampling-Based Multi-Objective Motion Planning
    Yi, Daqing
    Goodrich, Michael A.
    Seppi, Kevin D.
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1733 - 1739
  • [30] Multi-Objective Configuration Sampling for Performance Ranking in Configurable Systems
    Gu, Yongfeng
    Chen, Yuntianyi
    Jia, Xiangyang
    Xuan, Jifeng
    [J]. 2019 26TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC), 2019, : 150 - 157