Reinforcement Learning of Theorem Proving

被引:0
|
作者
Kaliszyk, Cezary [1 ]
Urban, Josef [2 ]
Michalewski, Henryk [3 ]
Olsak, Mirek [4 ]
机构
[1] Univ Innsbruck, Innsbruck, Austria
[2] Czech Tech Univ, Prague, Czech Republic
[3] Univ Warsaw, Inst Math, Polish Acad Sci, Warsaw, Poland
[4] Charles Univ Prague, Prague, Czech Republic
基金
欧洲研究理事会;
关键词
GAME; GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a theorem proving algorithm that uses practically no domain heuristics for guiding its connection-style proof search. Instead, it runs many Monte-Carlo simulations guided by reinforcement learning from previous proof attempts. We produce several versions of the prover, parameterized by different learning and guiding algorithms. The strongest version of the system is trained on a large corpus of mathematical problems and evaluated on previously unseen problems. The trained system solves within the same number of inferences over 40% more problems than a baseline prover, which is an unusually high improvement in this hard AI domain. To our knowledge this is the first time reinforcement learning has been convincingly applied to solving general mathematical problems on a large scale.
引用
收藏
页码:8836 / 8847
页数:12
相关论文
共 50 条
  • [31] Calculation by Tactic in Theorem Proving
    Li, Bing
    Zhang, Jian
    Su, Wei
    Li, Lian
    2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [32] Logical errors on proving theorem
    Sari, C. K.
    Waluyo, M.
    Ainur, C. M.
    Darmaningsih, E. N.
    1ST INTERNATIONAL CONFERENCE OF EDUCATION ON SCIENCES, TECHNOLOGY, ENGINEERING, AND MATHEMATICS (ICE-STEM), 2018, 948
  • [33] CONCEPT OF DEMODULATION IN THEOREM PROVING
    WOS, L
    ROBINSON, GA
    CARSON, DF
    SHALLA, L
    JOURNAL OF THE ACM, 1967, 14 (04) : 698 - &
  • [34] MODAL THEOREM-PROVING
    ABADI, M
    MANNA, Z
    LECTURE NOTES IN COMPUTER SCIENCE, 1986, 230 : 172 - 189
  • [35] Theorem proving for intensional logic
    1600, Kluwer Academic Publishers, Dordrecht, Netherlands (14):
  • [36] Automating Theorem Proving with SMT
    Leino, K. Rustan M.
    INTERACTIVE THEOREM PROVING, ITP 2013, 2013, 7998 : 2 - 16
  • [37] Mechanical theorem proving in geometry
    Jun-Yu, Gao
    Cheng-Dong, Zhang
    Telkomnika - Indonesian Journal of Electrical Engineering, 2012, 10 (07): : 1554 - 1559
  • [38] Theorem proving update - Response
    Sergei, S
    DR DOBBS JOURNAL, 1998, 23 (10): : 14 - 14
  • [39] COMPUTER THEOREM PROVING AND HOTT
    Leslie-Hurd, Joe
    Haworth, G. Mc C.
    ICGA JOURNAL, 2013, 36 (02) : 100 - 103
  • [40] ON AUTOMATED THEOREM-PROVING
    RUSSELL, S
    WHEELER, T
    ANNALS OF THE NEW YORK ACADEMY OF SCIENCES, 1992, 661 : 160 - 173