Evolving scientific discovery by unifying data and background knowledge with AI Hilbert

被引:0
|
作者
Cory-Wright, Ryan [1 ]
Cornelio, Cristina [2 ]
Dash, Sanjeeb [3 ]
El Khadir, Bachir [3 ]
Horesh, Lior [3 ]
机构
[1] Imperial Coll London, Dept Analyt Mkt & Operat, Business Sch, London, England
[2] Samsung AI, Cambridge, England
[3] IBM Thomas J Watson Res Ctr, Yorktown Hts, NY USA
关键词
INTERIOR-POINT METHODS; OPTIMIZATION; SQUARES; SUM; CONVERGENCE; POLYNOMIALS; REGRESSION; ALGORITHM; SELECTION; LAWS;
D O I
10.1038/s41467-024-50074-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The discovery of scientific formulae that parsimoniously explain natural phenomena and align with existing background theory is a key goal in science. Historically, scientists have derived natural laws by manipulating equations based on existing knowledge, forming new equations, and verifying them experimentally. However, this does not include experimental data within the discovery process, which may be inefficient. We propose a solution to this problem when all axioms and scientific laws are expressible as polynomials and argue our approach is widely applicable. We model notions of minimal complexity using binary variables and logical constraints, solve polynomial optimization problems via mixed-integer linear or semidefinite optimization, and prove the validity of our scientific discoveries in a principled manner using Positivstellensatz certificates. We demonstrate that some famous scientific laws, including Kepler's Law of Planetary Motion and the Radiated Gravitational Wave Power equation, can be derived in a principled manner from axioms and experimental data. Scientific discovery is a highly relevant task in natural sciences, however generating scientifically meaningful laws and determining their consistency remains challenging. The authors introduce an approach that exploits both experimental data and underlying theory in symbolic form to generate formulas that hold scientific significance by solving polynomial optimization problems.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Knowledge discovery in scientific data
    Rudolph, S
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY II, 2000, 4057 : 250 - 258
  • [2] Knowledge discovery process for scientific and engineering data
    Barrios, LJ
    Rudolph, S
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 118 - 125
  • [3] Evolving scientific knowledge
    Paetz, J
    [J]. COMPUTATIONAL INTELLIGENCE, THEORY AND APPLICATIONS, 2005, : 733 - 738
  • [4] Evolving scenario of big data and Artificial Intelligence (AI) in drug discovery
    Tripathi, Manish Kumar
    Nath, Abhigyan
    Singh, Tej P.
    Ethayathulla, A. S.
    Kaur, Punit
    [J]. MOLECULAR DIVERSITY, 2021, 25 (03) : 1439 - 1460
  • [5] Evolving scenario of big data and Artificial Intelligence (AI) in drug discovery
    Manish Kumar Tripathi
    Abhigyan Nath
    Tej P. Singh
    A. S. Ethayathulla
    Punit Kaur
    [J]. Molecular Diversity, 2021, 25 : 1439 - 1460
  • [6] Data-intensive architecture for scientific knowledge discovery
    Malcolm Atkinson
    Chee Sun Liew
    Michelle Galea
    Paul Martin
    Amrey Krause
    Adrian Mouat
    Oscar Corcho
    David Snelling
    [J]. Distributed and Parallel Databases, 2012, 30 : 307 - 324
  • [7] Data-intensive architecture for scientific knowledge discovery
    Atkinson, Malcolm
    Liew, Chee Sun
    Galea, Michelle
    Martin, Paul
    Krause, Amrey
    Mouat, Adrian
    Corcho, Oscar
    Snelling, David
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2012, 30 (5-6) : 307 - 324
  • [8] Unifying various knowledge discovery systems in logic of discovery
    Kikuchi, T
    Yamamoto, A
    [J]. INFORMATION MODELLING AND KNOWLEDGE BASES XIV, 2003, 94 : 118 - 127
  • [9] Combining data and theory for derivable scientific discovery with AI-Descartes
    Cornelio, Cristina
    Dash, Sanjeeb
    Austel, Vernon
    Josephson, Tyler R.
    Goncalves, Joao
    Clarkson, Kenneth L.
    Megiddo, Nimrod
    El Khadir, Bachir
    Horesh, Lior
    [J]. NATURE COMMUNICATIONS, 2023, 14 (01)
  • [10] Combining data and theory for derivable scientific discovery with AI-Descartes
    Cristina Cornelio
    Sanjeeb Dash
    Vernon Austel
    Tyler R. Josephson
    Joao Goncalves
    Kenneth L. Clarkson
    Nimrod Megiddo
    Bachir El Khadir
    Lior Horesh
    [J]. Nature Communications, 14