A Scalable Parallel Algorithm for Balanced Sampling

被引:0
|
作者
Lee, Alexander [1 ]
Walzer-Goldfeld, Stefan [1 ]
Zablah, Shukry [2 ]
Riondato, Matteo [1 ]
机构
[1] Amherst Coll, Dept Comp Sci, Box 2232, Amherst, MA 01002 USA
[2] Pallet Labs Inc, Boulder, CO USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel parallel algorithm for drawing balanced samples from large populations. When auxiliary variables about the population units are known, balanced sampling improves the quality of the estimations obtained from the sample. Available algorithms, e.g., the cube method, are inherently sequential, and do not scale to large populations. Our parallel algorithm is based on a variant of the cube method for stratified populations. It has the same sample quality as sequential algorithms, and almost ideal parallel speedup.
引用
收藏
页码:12991 / 12992
页数:2
相关论文
共 50 条
  • [1] A BALANCED SCALABLE PARALLEL PROCESSOR
    PILPEL, S
    [J]. VLSI SYSTEMS DESIGN, 1987, 8 (03): : 80 - &
  • [2] A fast algorithm for balanced sampling
    Chauvet, G
    Tillé, Y
    [J]. COMPUTATIONAL STATISTICS, 2006, 21 (01) : 53 - 62
  • [3] A fast algorithm for balanced sampling
    Guillaume Chauvet
    Yves Tillé
    [J]. Computational Statistics, 2006, 21 : 53 - 62
  • [4] A scalable parallel deduplication algorithm
    Santos, Walter
    Teixeira, Thiago
    Machado, Carla
    Meira, Wagner, Jr.
    Da Silva, Altigran S.
    Ferreira, Renato
    Guedes, Dorgival
    [J]. 19TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2007, : 79 - +
  • [5] Balanced parallel triangle enumeration with an adaptive algorithm
    Farouzi, Abir
    Zhou, Xiantian
    Bellatreche, Ladjel
    Malki, Mimoun
    Ordonez, Carlos
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2024, 42 (01) : 103 - 141
  • [6] Balanced parallel triangle enumeration with an adaptive algorithm
    Abir Farouzi
    Xiantian Zhou
    Ladjel Bellatreche
    Mimoun Malki
    Carlos Ordonez
    [J]. Distributed and Parallel Databases, 2024, 42 : 103 - 141
  • [7] A scalable parallel algorithm for the extraction of active contours
    Wakatani, A
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING - PARELEC 2000, PROCEEDINGS, 2000, : 94 - 98
  • [8] A scalable parallel algorithm for building web directories
    Seshadri, Karthick
    Maruthappan, Aswin
    Sundar Raman, Mukunthapriya
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (09):
  • [9] A scalable parallel algorithm for incomplete factor preconditioning
    Hysom, D
    Pothen, A
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2001, 22 (06): : 2194 - 2215
  • [10] A scalable parallel HITS algorithm for page ranking
    Bennett, Matthew
    Stone, Julie
    Zhang, Chaoyang
    [J]. FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 437 - +