Discovering Mathematical Expressions Through DeepSymNet: A Classification-Based Symbolic Regression Framework

被引:1
|
作者
Wu, Min [1 ,2 ,3 ,4 ]
Li, Weijun [1 ,2 ,3 ,4 ]
Yu, Lina [2 ,3 ,4 ]
Sun, Linjun [1 ,2 ,3 ,4 ]
Liu, Jingyi [1 ,2 ,3 ,4 ]
Li, Wenqiang [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Inst Semiconductors, AnnLab, Beijing 100083, Peoples R China
[2] Univ Chinese Acad Sci, Ctr Mat Sci & Optoelectron Engn, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Micro Elect, Beijing 100049, Peoples R China
[4] Beijing Key Lab Semicond Neural Network Intellige, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Prediction algorithms; Task analysis; Optimization; Deep learning; Robustness; Classification algorithms; AI for science; deep learning; symbolic network; symbolic regression (SR);
D O I
10.1109/TNNLS.2023.3332400
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Symbolic regression (SR) is the process of finding an unknown mathematical expression given the input and output and has important applications in interpretable machine learning and knowledge discovery. The major difficulty of SR is that finding the expression structure is an NP-hard problem, which makes the entire process time-consuming. In this study, the solution of expression structures was regarded as a classification problem and solved by supervised learning such that SR can be solved quickly by using the solving experience. Techniques for classification tasks, such as equivalent label merging and sample balance, were used to enhance the robustness of the algorithm. We proposed a symbolic network called DeepSymNet to represent symbolic expressions to improve the performance of the algorithm. DeepSymNet has been proven to have a strong representation ability with a shorter label compared to the current popular representation methods, reducing the search space when predicting. Moreover, DeepSymNet conveniently decomposes SR into two smaller subproblems, which makes solving the problem easier. The proposed algorithm was tested on artificially generated expressions and public datasets and compared with other algorithms. The results demonstrate the effectiveness of the proposed algorithm.
引用
收藏
页码:1356 / 1370
页数:15
相关论文
共 50 条
  • [31] CNN based spatial classification features for clustering offline handwritten mathematical expressions
    Cuong Tuan Nguyen
    Vu Tran Minh Khuong
    Hung Tuan Nguyen
    Nakagawa, Masaki
    PATTERN RECOGNITION LETTERS, 2020, 131 (131) : 113 - 120
  • [32] Classification-Based Framework for Remaining Useful Life Prediction With Limited Images and Unequal Time Intervals
    Zhu, Xiaoyan
    Lu, Chenxin
    Zhang, Ping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [33] Supervised classification-based framework for rock mass discontinuity identification using point cloud data
    Gunen, Mehmet Akif
    Ozturk, Kalif Furkan
    Aliyazicioglu, Sener
    ENGINEERING GEOLOGY, 2025, 350
  • [34] Improving classification-based diagnosis of batch processes through data selection and appropriate pretreatment
    Gins, Geert
    Van den Kerkhof, Pieter
    Vanlaer, Jef
    Van Impe, Jan F. M.
    JOURNAL OF PROCESS CONTROL, 2015, 26 : 90 - 101
  • [35] Discovering the optimal relationship hypothesis of car-following behaviors with neural network-based symbolic regression☆
    Li, Tenglong
    Ngoduy, Dong
    Lee, Seunghyeon
    Pu, Ziyuan
    Viti, Francesco
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2025, 170
  • [36] MACHINE LEARNING - A MATHEMATICAL FRAMEWORK FOR NEURAL NETWORK, SYMBOLIC AND GENETICS-BASED LEARNING
    OOSTHUIZEN, GD
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON GENETIC ALGORITHMS, 1989, : 385 - 390
  • [37] IRIS DATA CLASSIFICATION BY MEANS OF PSEUDO NEURAL NETWORKS BASED ON EVOLUTIONARY SYMBOLIC REGRESSION
    Oplatkova, Zuzana Kominkova
    Senkerik, Roman
    PROCEEDINGS 27TH EUROPEAN CONFERENCE ON MODELLING AND SIMULATION ECMS 2013, 2013, : 355 - +
  • [38] A Semantics based Symbolic Regression Framework for Mining Explicit and Implicit Equations from Data
    Quang Nhat Huynh
    Singh, Hemant Kumar
    Ray, Tapabrata
    PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'16 COMPANION), 2016, : 103 - 104
  • [39] An empirical classification-based framework for the safety criticality assessment of energy production systems, in presence of inconsistent data
    Wang, Tai-Ran
    Mousseau, Vincent
    Pedroni, Nicola
    Zio, Enrico
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2017, 157 : 139 - 151
  • [40] Detecting the Media-adventitia Border in Intravascular Ultrasound Images through a Classification-based Approach
    Wang, Yuan-yuan
    Qiu, Chen-hui
    Jiang, Jun
    Xia, Shun-ren
    ULTRASONIC IMAGING, 2019, 41 (02) : 78 - 93