Cost-based analyses of random neighbor and derived sampling methods

被引:3
|
作者
Novick, Yitzchak [1 ,2 ]
Bar-Noy, Amotz [3 ,4 ]
机构
[1] CUNY, Grad Ctr, Comp Sci Dept, New York, NY 10016 USA
[2] Touro Univ, Comp Sci Dept, New York, NY 10001 USA
[3] CUNY, Brooklyn Coll, Comp Sci Dept, Brooklyn, NY 11210 USA
[4] CUNY, Grad Ctr, Brooklyn, NY 11210 USA
关键词
Fair cost comparison; Random neighbor sampling; High-degree vertex sampling; FRIENDS;
D O I
10.1007/s41109-022-00475-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Random neighbor sampling, or RN, is a method for sampling vertices with a mean degree greater than that of the graph. Instead of naively sampling a vertex from a graph and retaining it ('random vertex' or RV), a neighbor of the vertex is selected instead. While considerable research has analyzed various aspects of RN, the extra cost of sampling a second vertex is typically not addressed. This paper explores RN sampling from the perspective of cost. We break down the cost of sampling into two distinct costs, that of sampling a vertex and that of sampling a neighbor of an already sampled vertex, and we also include the cost of actually selecting a vertex/neighbor and retaining it for use rather than discarding it. With these three costs as our cost-model, we explore RN and compare it to RV in a more fair manner than comparisons that have been made in previous research. As we delve into costs, a number of variants to RN are introduced. These variants improve on the cost-effectiveness of RN in regard to particular costs and priorities. Our full cost-benefit analysis highlights strengths and weaknesses of the methods. We particularly focus on how our methods perform for sampling high-degree and low-degree vertices, which further enriches the understanding of the methods and how they can be practically applied. We also suggest 'two-phase' methods that specifically seek to cover both high-degree and low-degree vertices in separate sampling phases.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Cost-based analyses of random neighbor and derived sampling methods
    Yitzchak Novick
    Amotz Bar-Noy
    [J]. Applied Network Science, 7
  • [2] COST-BASED ACCEPTANCE SAMPLING
    CASE, KE
    BENNETT, GK
    SCHMIDT, JW
    [J]. INDUSTRIAL ENGINEERING, 1972, 4 (11): : 26 - &
  • [3] Cost-Based Sampling of Individual Instances
    Klement, William
    Flach, Peter
    Japkowicz, Nathalie
    Matwin, Stan
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5549 : 86 - +
  • [4] Beyond Nyquist sampling: a cost-based approach
    Ozcelikkale, Ayca
    Ozaktas, Haldun M.
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2013, 30 (04) : 645 - 655
  • [5] Beyond Nyquist sampling: A cost-based approach
    Özçelikkale, Ayça
    Ozaktas, Haldun M.
    [J]. Journal of the Optical Society of America A: Optics and Image Science, and Vision, 2013, 30 (04): : 645 - 655
  • [6] A Fair-Cost Analysis of the Random Neighbor Sampling Method
    Novick, Yitzchak
    Bar-Noy, Amotz
    [J]. COMPLEX NETWORKS & THEIR APPLICATIONS X, VOL 1, 2022, 1015 : 3 - 15
  • [7] A cost-based heuristic for statistically determining sampling frequency in a wafer fab
    Chien, CF
    Hsu, SC
    Peng, S
    Wu, CH
    [J]. 2000 SEMICONDUCTOR MANUFACTURING TECHNOLOGY WORKSHOP, 2000, : 217 - 229
  • [8] Counting, enumerating, and sampling of execution plans in a cost-based query optimizer
    Waas, F
    Galindo-Legaria, C
    [J]. SIGMOD RECORD, 2000, 29 (02) : 499 - 509
  • [9] CRRP: Cost-Based Replacement with Random Placement for En-Route Caching
    Wang, Sen
    Bi, Jun
    Wu, Jianping
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (07): : 1914 - 1917
  • [10] Elements of cost-based tolerancing
    Youngworth, RN
    Stone, BD
    [J]. OPTICAL REVIEW, 2001, 8 (04) : 276 - 280