Information theoretic perspective on sample complexity

被引:3
|
作者
Pereg, Deborah [1 ,2 ,3 ,4 ,5 ]
机构
[1] MGH, Wellman Ctr Photomed, Boston, MA USA
[2] Harvard Med Sch, Sch Med, Boston, MA USA
[3] MIT CSAIL, Cambridge, MA USA
[4] MIT MechE, Cambridge, MA 02139 USA
[5] Harvard Sch Engn & Appl Sci, Cambridge, MA 02138 USA
关键词
Information theory; Supervised learning; Sample complexity; Generalization;
D O I
10.1016/j.neunet.2023.08.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The statistical supervised learning framework assumes an input-output set with a joint probability distribution that is reliably represented by the training dataset. The learning system is then required to output a prediction rule learned from the training dataset's input-output pairs. In this work, we investigate the relationship between the sample complexity, the empirical risk and the generalization error based on the asymptotic equipartition property (AEP) (Shannon, 1948). We provide theoretical guarantees for reliable learning under the information-theoretic AEP, with respect to the generalization error and the sample size in different settings.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:445 / 449
页数:5
相关论文
共 50 条
  • [1] INFORMATION-THEORETIC COMPUTATIONAL COMPLEXITY
    CHAITIN, GJ
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1974, 20 (01) : 10 - 15
  • [2] Autonomy: An information theoretic perspective
    Bertschinger, Nils
    Olbrich, Eckehard
    Ay, Nihat
    Jost, Juergen
    BIOSYSTEMS, 2008, 91 (02) : 331 - 345
  • [3] Information theoretic complexity affects multisensory perception
    Ellis, Cameron T.
    Turk-Browne, Nicholas B.
    VISUAL COGNITION, 2015, 23 (07) : 825 - 829
  • [4] An information theoretic tradeoff between complexity and accuracy
    Gilad-Bachrach, R
    Navot, A
    Tishby, N
    LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 595 - 609
  • [5] INFORMATION-THEORETIC COMPLEXITY OF PROGRAM SPECIFICATIONS
    COULTER, NS
    COOPER, RB
    SOLOMON, MK
    COMPUTER JOURNAL, 1987, 30 (03): : 223 - 227
  • [6] Information processing in dendrites - II. Information theoretic complexity
    Gurney, KN
    NEURAL NETWORKS, 2001, 14 (08) : 1005 - 1022
  • [7] Mean shift: An information theoretic perspective
    Rao, Sudhir
    Martins, Allan de Medeiros
    Principe, Jose C.
    PATTERN RECOGNITION LETTERS, 2009, 30 (03) : 222 - 230
  • [8] An information-theoretic perspective on teleconnections
    Greene, Arthur M.
    GEOPHYSICAL RESEARCH LETTERS, 2013, 40 (19) : 5258 - 5262
  • [9] Adaptive sorting: an information theoretic perspective
    Elmasry, Amr
    Fredman, Michael L.
    ACTA INFORMATICA, 2008, 45 (01) : 33 - 42
  • [10] Information theoretic perspective on genome clustering
    Veluchamy, Alaguraj
    Mehta, Preeti
    Srividhya, K. V.
    Vikram, Hirendra
    Govind, M. K.
    Gupta, Ramneek
    Bin Dukhyil, Abdul Aziz
    Alharbi, Raed Abdullah
    Aloyuni, Saleh Abdullah
    Hassan, Mohamed M.
    Krishnaswamy, S.
    SAUDI JOURNAL OF BIOLOGICAL SCIENCES, 2021, 28 (03) : 1867 - 1889