Information Bottleneck Methods for Distributed Learning

被引:0
|
作者
Farajiparvar, Parinaz [1 ]
Beirami, Ahmad [2 ]
Nokleby, Matthew [1 ]
机构
[1] Wayne State Univ, Dept Elect & Comp Engn, Detroit, MI 48202 USA
[2] MIT, Elect Res Lab, Cambridge, MA 02139 USA
关键词
Machine Learning; Rate-distortion function; Information Bottleneck; Distributed Learning; Streaming Data;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study a distributed learning problem in which Alice sends a compressed distillation of a set of training data to Bob, who uses the distilled version to best solve an associated learning problem. We formalize this as a rate-distortion problem in which the training set is the source and Bob's cross-entropy loss is the distortion measure. We consider this problem for unsupervised learning for batch and sequential data. In the batch data, this problem is equivalent to the information bottleneck (IB), and we show that reduced-complexity versions of standard IB methods solve the associated rate-distortion problem. For the streaming data, we present a new algorithm, which may be of independent interest, that solves the rate-distortion problem for Gaussian sources. Furthermore, to improve the results of the iterative algorithm for sequential data we introduce a two-pass version of this algorithm. Finally, we show the dependency of the rate on the number of samples k required for Gaussian sources to ensure cross-entropy loss that scales optimally with the growth of the training set.
引用
收藏
页码:24 / 31
页数:8
相关论文
共 50 条
  • [41] Deep representation learning for domain generalization with information bottleneck principle
    Zhang, Jiao
    Zhang, Xu-Yao
    Wang, Chuang
    Liu, Cheng-Lin
    [J]. PATTERN RECOGNITION, 2023, 143
  • [42] Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck
    Igl, Maximilian
    Ciosek, Kamil
    Li, Yingzhen
    Tschiatschek, Sebastian
    Zhang, Cheng
    Devlin, Sam
    Hofmann, Katja
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [43] Multi-View Information-Bottleneck Representation Learning
    Wan, Zhibin
    Zhang, Changqing
    Zhu, Pengfei
    Hu, Qinghua
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10085 - 10092
  • [44] Reduced-Complexity Optimization of Distributed Quantization Using the Information Bottleneck Principle
    Steiner, Steffen
    Kuehn, Volker
    Stark, Maximilian
    Bauch, Gerhard
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 : 1267 - 1278
  • [45] Multi-Source Distributed Data Compression Based on Information Bottleneck Principle
    Hassanpour, Shayan
    Danaee, Alireza
    Wuebben, Dirk
    Dekorsy, Armin
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 4171 - 4185
  • [46] Learning Foreground Information Bottleneck for few-shot semantic segmentation
    Hu, Yutao
    Huang, Xin
    Luo, Xiaoyan
    Han, Jungong
    Cao, Xianbin
    Zhang, Jun
    [J]. PATTERN RECOGNITION, 2024, 146
  • [47] ToxIBTL: prediction of peptide toxicity based on information bottleneck and transfer learning
    Wei, Lesong
    Ye, Xiucai
    Sakurai, Tetsuya
    Mu, Zengchao
    Wei, Leyi
    [J]. BIOINFORMATICS, 2022, 38 (06) : 1514 - 1524
  • [48] Information bottleneck and selective noise supervision for zero-shot learning
    Zhou, Lei
    Liu, Yang
    Zhang, Pengcheng
    Bai, Xiao
    Gu, Lin
    Zhou, Jun
    Yao, Yazhou
    Harada, Tatsuya
    Zheng, Jin
    Hancock, Edwin
    [J]. MACHINE LEARNING, 2023, 112 (07) : 2239 - 2261
  • [49] GOAL-ORIENTED COMMUNICATION FOR EDGE LEARNING BASED ON THE INFORMATION BOTTLENECK
    Pezone, Francesco
    Barbarossa, Sergio
    Di Lorenzo, Paolo
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8832 - 8836
  • [50] An inherent bottleneck in distributed counting
    Wattenhofer, R
    Widmayer, P
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1998, 49 (01) : 135 - 145