Autoformalization with Large Language Models

被引:0
|
作者
Wu, Yuhuai [1 ,2 ]
Jiang, Albert Q. [3 ]
Li, Wenda [3 ]
Rabe, Markus N. [1 ]
Staats, Charles [1 ]
Jamnik, Mateja [3 ]
Szegedy, Christian [1 ]
机构
[1] Google Res, Chicago, IL 60607 USA
[2] Stanford Univ, Stanford, CA 94305 USA
[3] Univ Cambridge, Cambridge CB2 1TN, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autoformalization is the process of automatically translating from natural language mathematics to formal specifications and proofs. A successful autoformalization system could advance the fields of formal verification, program synthesis, and artificial intelligence. While the long-term goal of autoformalization seemed elusive for a long time, we show large language models provide new prospects towards this goal. We make the surprising observation that LLMs can correctly translate a significant portion (25:.%) of mathematical competition problems perfectly to formal specifications in Isabelle/HOL. We demonstrate the usefulness of this process by improving a previously introduced neural theorem prover via training on these autoformalized theorems. Our methodology results in a new state-of-the-art result on the MiniF2F theorem proving benchmark, improving the proof rate from 29.6% to 35.2%
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Large Language Models are Not Models of Natural Language: They are Corpus Models
    Veres, Csaba
    [J]. IEEE ACCESS, 2022, 10 : 61970 - 61979
  • [2] Large Language Models
    Vargas, Diego Collarana
    Katsamanis, Nassos
    [J]. ERCIM NEWS, 2024, (136): : 12 - 13
  • [3] Large Language Models
    Cerf, Vinton G.
    [J]. COMMUNICATIONS OF THE ACM, 2023, 66 (08) : 7 - 7
  • [4] Large Language Models in der WissenschaftLarge language models in science
    Karl-Friedrich Kowalewski
    Severin Rodler
    [J]. Die Urologie, 2024, 63 (9) : 860 - 866
  • [5] The Importance of Understanding Language in Large Language Models
    Youssef, Alaa
    Stein, Samantha
    Clapp, Justin
    Magnus, David
    [J]. AMERICAN JOURNAL OF BIOETHICS, 2023, 23 (10): : 6 - 7
  • [6] Dissociating language and thought in large language models
    Mahowald, Kyle
    Ivanova, Anna A.
    Blank, Idan A.
    Kanwisher, Nancy
    Tenenbaum, Joshua B.
    Fedorenko, Evelina
    [J]. TRENDS IN COGNITIVE SCIENCES, 2024, 28 (06) : 517 - 540
  • [7] Imitation and Large Language Models
    Boisseau, Éloïse
    [J]. Minds and Machines, 2024, 34 (04)
  • [8] The Smallness of Large Language Models
    Denning, Peter J.
    [J]. COMMUNICATIONS OF THE ACM, 2023, 66 (09) : 24 - 27
  • [9] Large language models in medicine
    Arun James Thirunavukarasu
    Darren Shu Jeng Ting
    Kabilan Elangovan
    Laura Gutierrez
    Ting Fang Tan
    Daniel Shu Wei Ting
    [J]. Nature Medicine, 2023, 29 : 1930 - 1940
  • [10] Large language models in medicine
    Thirunavukarasu, Arun James
    Ting, Darren Shu Jeng
    Elangovan, Kabilan
    Gutierrez, Laura
    Tan, Ting Fang
    Ting, Daniel Shu Wei
    [J]. NATURE MEDICINE, 2023, 29 (08) : 1930 - 1940