Reinforcement Learning of Theorem Proving

被引:0
|
作者
Kaliszyk, Cezary [1 ]
Urban, Josef [2 ]
Michalewski, Henryk [3 ]
Olsak, Mirek [4 ]
机构
[1] Univ Innsbruck, Innsbruck, Austria
[2] Czech Tech Univ, Prague, Czech Republic
[3] Univ Warsaw, Inst Math, Polish Acad Sci, Warsaw, Poland
[4] Charles Univ Prague, Prague, Czech Republic
基金
欧洲研究理事会;
关键词
GAME; GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a theorem proving algorithm that uses practically no domain heuristics for guiding its connection-style proof search. Instead, it runs many Monte-Carlo simulations guided by reinforcement learning from previous proof attempts. We produce several versions of the prover, parameterized by different learning and guiding algorithms. The strongest version of the system is trained on a large corpus of mathematical problems and evaluated on previously unseen problems. The trained system solves within the same number of inferences over 40% more problems than a baseline prover, which is an unusually high improvement in this hard AI domain. To our knowledge this is the first time reinforcement learning has been convincingly applied to solving general mathematical problems on a large scale.
引用
收藏
页码:8836 / 8847
页数:12
相关论文
共 50 条
  • [41] On Interpolation in Automated Theorem Proving
    Maria Paola Bonacina
    Moa Johansson
    Journal of Automated Reasoning, 2015, 54 : 69 - 97
  • [42] Directed automated theorem proving
    Edelkamp, S
    Leven, P
    LOGIC FOR PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND REASONING, 2002, 2514 : 145 - 159
  • [43] THEOREM-PROVING WITH ABSTRACTION
    PLAISTED, DA
    ARTIFICIAL INTELLIGENCE, 1981, 16 (01) : 47 - 108
  • [44] Applications of polytypism in theorem proving
    Slind, K
    Hurd, J
    THEOREM PROVING IN HIGHER ORDER LOGICS, PROCEEDINGS, 2003, 2758 : 103 - 119
  • [45] THEOREM PROVING BY COVERING EXPRESSIONS
    HENSCHEN, LJ
    JOURNAL OF THE ACM, 1979, 26 (03) : 385 - 400
  • [46] RESOLUTION THEOREM-PROVING
    STICKEL, ME
    ANNUAL REVIEW OF COMPUTER SCIENCE, 1988, 3 : 285 - 316
  • [47] THEOREM PROVING IN THE ONTOLOGY LIFECYCLE
    Katsumi, Megan
    Gruninger, Michael
    KEOD 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND ONTOLOGY DEVELOPMENT, 2010, : 37 - 49
  • [48] Modular redundancy for theorem proving
    Bofill, N
    Godoy, G
    Nieuwenhuis, R
    Rubio, A
    FRONTIERS OF COMBINING SYSTEMS, 2000, 1794 : 186 - 199
  • [49] Automated Theorem Proving in the Classroom
    Windsteiger, Wolfgang
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2021, (352): : 54 - 63
  • [50] Computer theorem proving in mathematics
    Simpson, C
    LETTERS IN MATHEMATICAL PHYSICS, 2004, 69 (1) : 287 - 315