Data Summarization and Distributed Computation

被引:0
|
作者
Cormode, Graham [1 ]
机构
[1] Univ Warwick, Comp Sci, Data Management Privacy & Big Data Anal, Warwick, England
基金
欧洲研究理事会;
关键词
D O I
10.1145/3212734.3212795
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The notion of summarization is to provide a compact representation of data which approximately captures its essential characteristics. If such summaries can be created, they can lead to efficient distributed algorithms which exchange summaries in order to compute a desired function. In this talk, I'll describe recent efforts in this direction for problems inspired by machine learning: building graphical models over evolving, distributed training examples, and solving robust regression problems over large, distributed data sets.
引用
收藏
页码:167 / 168
页数:2
相关论文
共 50 条
  • [1] Summarization for Geographically Distributed Data Streams
    Ciampi, Anna
    Appice, Annalisa
    Malerba, Donato
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT III, 2010, 6278 : 339 - 348
  • [2] Verifiable Local Computation on Distributed Data
    Zhang, Liang Feng
    Safavi-Naini, Reihaneh
    Liu, Xiao Wei
    [J]. SCC'14: PROCEEDINGS OF THE 2ND INTERNATIONAL WORKSHOP ON SECURITY IN CLOUD COMPUTING, 2014, : 3 - 10
  • [3] Incremental and accurate computation of machine learning models with smart data summarization
    Al-Amin, Sikder Tahsin
    Ordonez, Carlos
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2022, 59 (01) : 149 - 172
  • [4] Incremental and accurate computation of machine learning models with smart data summarization
    Sikder Tahsin Al-Amin
    Carlos Ordonez
    [J]. Journal of Intelligent Information Systems, 2022, 59 : 149 - 172
  • [5] Differentially Private Distributed Data Summarization under Covariate Shift
    Sarpatwar, Kanthi K.
    Shanmugam, Karthikeyan
    Ganapavarapu, Venkata Sitaramagiridharganesh
    Jagmohan, Ashish
    Vaculin, Roman
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] Fuzzy logic and the internet: Lingustic summarization of distributed sets of data
    Kacprzyk, Janusz
    [J]. COMPUTATIONAL INTELLIGENCE: THEORY AND APPLICATIONS, PROCEEDINGS, 2001, 2206 : 40 - 42
  • [7] Fast Distributed Submodular Cover: Public-Private Data Summarization
    Mirzasoleiman, Baharan
    Zadimoghaddam, Morteza
    Karbasi, Amin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [8] Ryoan: A Distributed Sandbox for Untrusted Computation on Secret Data
    Hunt, Tyler
    Zhu, Zhiting
    Xu, Yuanzhong
    Peter, Simon
    Witchel, Emmett
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2018, 35 (04):
  • [9] Probabilistic Skyline Computation on Vertically Distributed Uncertain Data
    Zhang, Kaiqi
    Wang, Jinbao
    Wang, Muxian
    Han, Xixian
    [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 154 - 163
  • [10] Ryoan: A Distributed Sandbox for Untrusted Computation on Secret Data
    Hunt, Tyler
    Zhu, Zhiting
    Xu, Yuanzhong
    Peter, Simon
    Witchel, Emmett
    [J]. PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, 2016, : 533 - 549