Communication-Efficient Computation on Distributed Noisy Datasets

被引:5
|
作者
Zhang, Qin [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
ALGORITHMS; MODEL;
D O I
10.1145/2755573.2755575
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper gives a first attempt to answer the following general question: Given a set of machines connected by a point-to-point communication network, each having a noisy dataset, how can we perform communication-efficient statistical estimations on the union of these datasets? Here 'noisy' means that a real-world entity may appear in different forms in different datasets, but those variants should be considered as the same universe element when performing statistical estimations. We give a first set of communicationefficient solutions for statistical estimations on distributed noisy datasets, including algorithms for distinct elements, L-0-sampling, heavy hitters, frequency moments and empirical entropy.
引用
收藏
页码:313 / 322
页数:10
相关论文
共 50 条
  • [1] Communication-Efficient Distributed Skyline Computation
    Zhang, Haoyu
    Zhang, Qin
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 437 - 446
  • [2] Communication-efficient Conformal Prediction for Distributed Datasets
    Riquelme-Granada, Nery
    Luo, Zhiyuan
    Khuong An Nguyen
    [J]. CONFORMAL AND PROBABILISTIC PREDICTION WITH APPLICATIONS, VOL 179, 2022, 179
  • [3] Communication-Efficient Secure Distributed Estimation With Noisy Measurement Against FDI Attack
    Zhang, Zhanxi
    Jia, Lijuan
    Peng, Senran
    Yang, Zi-Jiang
    Tao, Ran
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1214 - 1218
  • [4] Communication-Efficient Collaborative Learning of Geo-Distributed JointCloud from Heterogeneous Datasets
    Li, Xiaoli
    Liu, Nan
    Chen, Chuan
    Zheng, Zibin
    Li, Huizhong
    Yan, Qiang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING (JCC 2020), 2020, : 22 - 29
  • [5] FAST AND COMMUNICATION-EFFICIENT DISTRIBUTED PCA
    Gang, Arpita
    Raja, Haroon
    Bajwa, Waheed U.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7450 - 7454
  • [6] Communication-Efficient Distributed Eigenspace Estimation
    Charisopoulos, Vasileios
    Benson, Austin R.
    Damle, Anil
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (04): : 1067 - 1092
  • [7] Communication-efficient distributed oblivious transfer
    Beimel, Amos
    Chee, Yeow Meng
    Wang, Huaxiong
    Zhang, Liang Feng
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2012, 78 (04) : 1142 - 1157
  • [8] Communication-Efficient Distributed Learning: An Overview
    Cao, Xuanyu
    Basar, Tamer
    Diggavi, Suhas
    Eldar, Yonina C.
    Letaief, Khaled B.
    Poor, H. Vincent
    Zhang, Junshan
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (04) : 851 - 873
  • [9] Communication-Efficient Distributed Statistical Inference
    Jordan, Michael I.
    Lee, Jason D.
    Yang, Yun
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 668 - 681
  • [10] Communication-efficient Distributed SGD with Sketching
    Ivkin, Nikita
    Rothchild, Daniel
    Ullah, Enayat
    Braverman, Vladimir
    Stoica, Ion
    Arora, Raman
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32