Stochastic Thermodynamics of Learning Parametric Probabilistic Models

被引:0
|
作者
Parsi, Shervin S. [1 ,2 ]
机构
[1] CUNY, Grad Ctr, Phys Program, New York, NY 10016 USA
[2] CUNY, Grad Ctr, Initiat Theoret Sci, New York, NY 10016 USA
基金
美国国家卫生研究院;
关键词
parameritic generative models; machine learning; thermodynamics of information; entropy production; information theory;
D O I
10.3390/e26020112
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We have formulated a family of machine learning problems as the time evolution of parametric probabilistic models (PPMs), inherently rendering a thermodynamic process. Our primary motivation is to leverage the rich toolbox of thermodynamics of information to assess the information-theoretic content of learning a probabilistic model. We first introduce two information-theoretic metrics, memorized information (M-info) and learned information (L-info), which trace the flow of information during the learning process of PPMs. Then, we demonstrate that the accumulation of L-info during the learning process is associated with entropy production, and the parameters serve as a heat reservoir in this process, capturing learned information in the form of M-info.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Stochastic Online Learning with Probabilistic Graph Feedback
    Li, Shuai
    Chen, Wei
    Wen, Zheng
    Leung, Kwong-Sak
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4675 - 4682
  • [22] STOCHASTIC IMITATION OF INSTRUMENTAL REFLEX AT PROBABILISTIC LEARNING
    SALTYKOV, AB
    SMIRNOV, IV
    STARSHOV, VP
    ZHURNAL VYSSHEI NERVNOI DEYATELNOSTI IMENI I P PAVLOVA, 1989, 39 (05) : 974 - 981
  • [23] Learning to Schedule in Diffusion Probabilistic Models
    Wang, Yunke
    Wang, Xiyu
    Anh-Dung Dinh
    Du, Bo
    Xu, Chang
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2478 - 2488
  • [24] Probabilistic Models for Supervised Dictionary Learning
    Lian, Xiao-Chen
    Li, Zhiwei
    Wang, Changhu
    Lu, Bao-Liang
    Zhan, Lei
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2305 - 2312
  • [25] Robust Learning of Tractable Probabilistic Models
    Peddi, Rohith
    Rahman, Tahrima
    Gogate, Vibhav
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 1572 - 1581
  • [26] Learning probabilistic models of link structure
    Getoor, L
    Friedman, N
    Koller, D
    Taskar, B
    JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 679 - 707
  • [27] GENERAL PROBABILISTIC LEARNING MODELS.
    Uppuluri, V.R.R.
    Piziak, R.
    International Journal on Policy and Information, 1984, 8 (01): : 71 - 83
  • [28] Probabilistic models for stochastic elliptic partial differential equations
    Grigoriu, Mircea
    JOURNAL OF COMPUTATIONAL PHYSICS, 2010, 229 (22) : 8406 - 8429
  • [29] A STOCHASTIC MINIMUM DISTANCE TEST FOR MULTIVARIATE PARAMETRIC MODELS
    BERAN, R
    MILLAR, PW
    ANNALS OF STATISTICS, 1989, 17 (01): : 125 - 140
  • [30] Learning Neural Parametric Head Models
    Giebenhain, Simon
    Kirschstein, Tobias
    Georgopoulos, Markos
    Runz, Martin
    Agapito, Lourdes
    Niessner, Matthias
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21003 - 21012