Discovering Mathematical Expressions Through DeepSymNet: A Classification-Based Symbolic Regression Framework

被引:1
|
作者
Wu, Min [1 ,2 ,3 ,4 ]
Li, Weijun [1 ,2 ,3 ,4 ]
Yu, Lina [2 ,3 ,4 ]
Sun, Linjun [1 ,2 ,3 ,4 ]
Liu, Jingyi [1 ,2 ,3 ,4 ]
Li, Wenqiang [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Inst Semiconductors, AnnLab, Beijing 100083, Peoples R China
[2] Univ Chinese Acad Sci, Ctr Mat Sci & Optoelectron Engn, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Micro Elect, Beijing 100049, Peoples R China
[4] Beijing Key Lab Semicond Neural Network Intellige, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Prediction algorithms; Task analysis; Optimization; Deep learning; Robustness; Classification algorithms; AI for science; deep learning; symbolic network; symbolic regression (SR);
D O I
10.1109/TNNLS.2023.3332400
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Symbolic regression (SR) is the process of finding an unknown mathematical expression given the input and output and has important applications in interpretable machine learning and knowledge discovery. The major difficulty of SR is that finding the expression structure is an NP-hard problem, which makes the entire process time-consuming. In this study, the solution of expression structures was regarded as a classification problem and solved by supervised learning such that SR can be solved quickly by using the solving experience. Techniques for classification tasks, such as equivalent label merging and sample balance, were used to enhance the robustness of the algorithm. We proposed a symbolic network called DeepSymNet to represent symbolic expressions to improve the performance of the algorithm. DeepSymNet has been proven to have a strong representation ability with a shorter label compared to the current popular representation methods, reducing the search space when predicting. Moreover, DeepSymNet conveniently decomposes SR into two smaller subproblems, which makes solving the problem easier. The proposed algorithm was tested on artificially generated expressions and public datasets and compared with other algorithms. The results demonstrate the effectiveness of the proposed algorithm.
引用
收藏
页码:1356 / 1370
页数:15
相关论文
共 50 条
  • [41] Enhancing Natural Language Query to SQL Query Generation Through Classification-Based Table Selection
    Chopra, Ankush
    Azam, Rauful
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2024, 2024, 2141 : 152 - 165
  • [42] A multitask classification framework based on vision transformer for predicting molecular expressions of glioma
    Xu, Qian
    Xu, Qian Qian
    Shi, Nian
    Dong, Li Na
    Zhu, Hong
    Xu, Kai
    EUROPEAN JOURNAL OF RADIOLOGY, 2022, 157
  • [43] Performance Evaluation of One-Class Classification-based Control Charts through an Industrial Application
    Gani, Walid
    Limam, Mohamed
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2013, 29 (06) : 841 - 854
  • [44] WDMA-UWB Indoor Positioning Through Channel Classification-Based NLOS Mitigation Approach
    Liu, Qingzhi
    Zhao, Yanlong
    Yin, Zhendong
    Wu, Zhilu
    IEEE SENSORS JOURNAL, 2024, 24 (18) : 28995 - 29005
  • [45] A Robust Approach for Multi Classification-Based Intrusion Detection through Stacking Deep Learning Models
    Chelloug, Samia Allaoua
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03): : 4845 - 4861
  • [46] Pathway activity inference for multiclass disease classification through a mathematical programming optimisation framework
    Yang, Lingjian
    Ainali, Chrysanthi
    Tsoka, Sophia
    Papageorgiou, Lazaros G.
    BMC BIOINFORMATICS, 2014, 15
  • [47] Pathway activity inference for multiclass disease classification through a mathematical programming optimisation framework
    Lingjian Yang
    Chrysanthi Ainali
    Sophia Tsoka
    Lazaros G Papageorgiou
    BMC Bioinformatics, 15
  • [48] Discovering explicit Reynolds-averaged turbulence closures for turbulent separated flows through deep learning-based symbolic regression with non-linear corrections
    Tang, Hongwei
    Wang, Yan
    Wang, Tongguang
    Tian, Linlin
    PHYSICS OF FLUIDS, 2023, 35 (02)
  • [49] A Hybrid SVC-CNN based Classification Model for Handwritten Mathematical Expressions(Numbers and Operators)
    Sakshi
    Kukreja, Vinay
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 321 - 325
  • [50] EvoCC: An Open-Source Classification-Based Nature-Inspired Optimization Clustering Framework in Python']Python
    Dang, Anh T.
    Qaddoura, Raneem
    Al-Zoubi, Ala' M.
    Faris, Hossam
    Castillo, Pedro A.
    APPLICATIONS OF EVOLUTIONARY COMPUTATION (EVOAPPLICATIONS 2022), 2022, : 77 - 92