MACHINE LEARNING AND INFORMATION THEORY CONCEPTS TOWARDS AN AI MATHEMATICIAN

被引:0
|
作者
Bengio, Yoshua [1 ,2 ]
Malkin, Nikolay [1 ,2 ]
机构
[1] Univ Montreal, Mila Quebec AI Inst, Montreal, PQ, Canada
[2] Univ Montreal, Dept Informat & Operat Res, Montreal, PQ, Canada
关键词
D O I
10.1090/bull/1839
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The current state of the art in artificial intelligence is impressive, especially in terms of mastery of language, but not so much in terms of mathematical reasoning. What could be missing? Can we learn something useful about that gap from how the brains of mathematicians go about their craft? This essay builds on the idea that current deep learning mostly succeeds at system 1 abilities-which correspond to our intuition and habitual behaviors-but still lacks something important regarding system 2 abilities-which include reasoning and robust uncertainty estimation. It takes an information-theoretical posture to ask questions about what constitutes an interesting mathematical statement, which could guide future work in crafting an AI mathematician. The focus is not on proving a given theorem but on discovering new and interesting conjectures. The central hypothesis is that a desirable body of theorems better summarizes the set of all provable statements, for example, by having a small description length while at the same time being close (in terms of number of derivation steps) to many provable statements.
引用
收藏
页码:457 / 469
页数:13
相关论文
共 50 条
  • [31] Simplifying AI and machine learning
    Siegel, Eliot
    [J]. APPLIED RADIOLOGY, 2018, 47 (05) : 26 - 28
  • [32] Towards a theory of practice in metaheuristics design: A machine learning perspective
    Birattari, Mauro
    Zlochin, Mark
    Dorigo, Marco
    [J]. RAIRO-THEORETICAL INFORMATICS AND APPLICATIONS, 2006, 40 (02): : 353 - 369
  • [33] Software Testing as A Problem of Machine Learning: Towards a Foundation on Computational Learning Theory
    Zhu, Hong
    [J]. 2018 IEEE/ACM 13TH INTERNATIONAL WORKSHOP ON AUTOMATION OF SOFTWARE TEST (AST), 2018, : 1 - 1
  • [34] Conjugate information systems: Learning cognitive concepts in rough set theory
    Semeniuk-Polkowska, M
    Polkowski, L
    [J]. ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, 2003, 2639 : 255 - 258
  • [35] AERO-ENGINES AI - A MACHINE-LEARNING APP FOR AIRCRAFT ENGINE CONCEPTS ASSESSMENT
    Tong, Michael T.
    [J]. PROCEEDINGS OF ASME TURBO EXPO 2023: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, GT2023, VOL 1, 2023,
  • [36] Viewpoint: Ai as author - bridging the gap between machine learning and literary theory
    Van Heerden I.
    Bas A.
    [J]. Journal of Artificial Intelligence Research, 2021, 71 : 175 - 189
  • [37] Item response theory in AI: Analysing machine learning classifiers at the instance level
    Martinez-Plumed, Fernando
    Prudencio, Ricardo B. C.
    Martinez-Uso, Adolfo
    Hernandez-Orallo, Jose
    [J]. ARTIFICIAL INTELLIGENCE, 2019, 271 : 18 - 42
  • [38] Refining Concepts by Machine Learning
    Mensik, Marek
    Duzi, Marie
    Albert, Adam
    Patschka, Vojtech
    Pajr, Miroslav
    [J]. COMPUTACION Y SISTEMAS, 2019, 23 (03): : 943 - 958
  • [39] Energy use prediction with information theory and machine learning technique
    Tong, Y. W.
    Yang, W. Y.
    Zhan, D. L.
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON ENERGY AND ENVIRONMENTAL SCIENCE, 2019, 291
  • [40] Compressive Privacy: From Information/Estimation Theory to Machine Learning
    Kung, S. Y.
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (01) : 94 - +